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CIp-protease as ttrget for herbicides ||tf20 RSC'Cl PCT/PTO 01 JUI<1 2006 

The present invention relates to CIp-protease, which, when absent, brings about re- 
, duced growth and chlorotic leaves as target for herbicides. For this purpose, novel nu- 
5 cleic acid sequences encompassing SEQ ID NO:3, SEQ ID NO:1 1 and SEQ ID NO: 17- 
and functional equivalents of SEQ ID NO:3, SEQ ID NO:1 1 and SEQ ID NO: 17 are 
provided. Moreover, the present invention relates to the use of CIp-protease in a 
method for identifying compounds with herbicidal or growth-regulatory activity, and to 
the use of the compounds identified by this method as herbicides or growth regulators. . 



10 

The basic principle of identifying herbicides via the inhibition of a defined target is 
known (for example US 5,187,071, WO 98/33925, WO 00/77185). In general, there is a 
great demand for the detection of enzymes which might constitute novel targets for 
herbicides. The reasons are resistance problems which occur with herbicidal active 
15 ingredients which act on known targets, and the ongoing endeavor to identify novel 

herbicidal active ingredients which are distinguished by as wide as possible a spectrum 
of action, ecological and toxicological acceptability and/or low application rates. 

In practice, the detection of novel targets entails great difficulties since the inhibition of 
20 an enzyme which forms part of a metabolic pathway frequently has no further effect on 
the growth of the plant. This may be attributed to the fact that the plant switches to al- 
ternative metabolic pathways whose existence is not known or that the inhibited en- 
zyme is not limiting for the metabolic pathway. Furthermore, plant genomes are distin- 
guished by a high degree of functional redundancy. Functionally equivalent enzymes 
25 are found more frequently in gene families in the Arabidopsis thaliana genome than in 
insects or mammals (Nature, 2000, 408(6814):796-815). This hypothesis is confirmed 
experimentally by the fact that comprehensive gene knock-out programs by T-DNA or 
transposon insertion into Arabidopsis yielded fewer manifested pheriotypes to date 
than expected (Curr. Op. Plant Biol. 4, 2001, pp.1 11-1 17). 
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It is an object of the present invention to identify novel targets which are essential for 
the growth of plants or whose inhibition leads to reduced plant growth, and to provide 
methods which are suitable for identifying herbicidally active and/or growth-regulatory 
compounds. 

We have found that this object is achieved by the use of nuclear encoded CIp-protease 
in a method for identifying herbicides. 

Further terms used in the description are now defined at this point. 

"Affinity tag": this refers to a peptide or polypeptide whose coding nucleic acid se- 
quence can be fused to the nucleic acid sequence according to the invention either 
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directly or by means of a linker, using customary cloning techniques. The affinity tag 
serves for the isolation, concentration and/or selective purification of the recombinant 
target protein by means of affinity chromatography from total cell extracts. The above- 
mentioned linker can advantageously contain a protease cleavage site (for example for 
5 thrombin or factor Xa), whereby the affinity tag can be cleaved from the target protein * 
when required. Examples of common affinity tags are the "His tag", for example from 
Qiagen, Hilden, "Strep tag", the "Myc tag" (Invitrogen, Carlsberg), the tag from New 
England Biolabs which consists of a chitin-binding domain and an inteine, the maltose- 
. binding protein (pMal) from New England Biolabs, and what is known as the CBD tag 
10 from Novagen. In this context, the affinity tag can be attached to the 5' or the 3' end of 
the coding nucleic acid sequence with the sequence encoding the target protein. 

"Activity of nuclear encoded Clp-protease": the term activity describes the ability of an 
enzyme to convert a substrate into a product. The enzymatic activity can be deter- 

15 mined in what is known as an activity assay via the increase in the product, the de- 
crease in the substrate (or starting material) or the decrease in a specific cofactor, or 
via a combination of at least two of the abovementioned parameters, as a function of a 
defined period of time. "Activity of nuclear encoded Clp-protease" describes here the 
ability of an enzyme to catalyze the hydrolysis of peptides of maximal five amino acids 

20 in vitro. 

"Expression cassette": an expression cassette contains a nucleic acid sequence ac- 
cording to the invention linked operably to at least one genetic control element, such as 
a promoter, and, advantageously, a further control element, such as a terminator. The 

25 nucleic acid sequence of the expression cassette can be for example a genomic or 
complementary DNA sequence or an RNA sequence, and their semisynthetic or fully 
synthetic analogs. These sequences can exist in linear or circular form, extrachromo- 
somally or integrated into the genome. The nucleic acid sequences in question can be 
synthesized or obtained naturally or contain a mixture of synthetic and natural DNA 

30 components, or else consist of various heterologous gene segments of various organ- 
isms. 

Artificial nucleic acid sequences are also suitable in this context as long as they make 
possible the expression, in a cell or an organism, of a polypeptide with the enzymatic 
35 activity of a nuclear encoded Clp Protease, preferably with the biological activity of a a 
nuclear encoded Clp Protease, which polypeptide is encoded by a nucleic acid se- 
quence according to the invention. For example, synthetic nucleotide sequences can 
be generated which have been optimized with regard to the codon usage of the organ- 
isms to be transformed. 



40 



All of the abovementioned nucleotide sequences can be generated from the nucleotide 
units by chemical synthesis in the manner known per se, for example by fragment con- 
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densation of individual overlapping complementary nucleotide units of the double helix. 
Oligonucleotides can be synthesized chemically for example in the manner known per 
se using the phosphoamidite method (Voet, Voet, 2nd Edition, Wiley Press New York, 
pp. 896-897). When preparing an expression cassette, various DNA fragments can be 
5 manipulated in such a way that a nucleotide sequence with the correct direction of 
reading and the correct reading frame is obtained. The nucleic acid fragments are 
linked with each other via general cloning techniques as are described, for example, in 
T. Maniatis, E.F. Fritsch and J. Sambrook, "Molecular Cloning: A Laboratory Manual", 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989), in T.J. Silhavy, M.L. 
10 Berman and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Harbor Labo- 
ratory, Cold Spring Harbor, NY (1984) and in Ausubel, F.M. et al., "Current Protocols in 
Molecular Biology", Greene Publishing Assoc. and Wiley-lnterscience (1994). 

"Operable linkage" or "functional linkage": an operable, or functional, linkage is under- 
15 stood as meaning the sequential arrangement of regulatory sequences or genetic con- 
trol elements in such a way that each of the regulatory sequences, or each of the ge- 
netic control elements, can fulfill its intended function when the coding sequence is 
expressed. 

20 "Functional equivalents" describe, in the present context, nucleic acid sequences which 
hybridize under standard conditions with the nucleic acid sequence SEQ ID NO:1, SEQ 
ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, 
SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17 or parts of the aforementioned nu- 
cleic acid sequences and which are capable of bringing about the expression, in a cell 

25 or an organism, of a polypeptide with the activity of Clp protease. 

To carry out the hybridization, it is advantageous to use short oligonucleotides with a 
length of approximately 10-50 bp, preferably 15-40 bp, for example of the conserved or 
other regions, which can be determined in the manner with which the skilled worker is 

30 familiar by comparisons with other related genes. However, longer fragments of the 
nucleic acids according to the invention with a length of 100-500 bp, or the complete 
sequences, may also be used for hybridization. Depending on the nucleic 
acid/oligonucleotide used, the length of the fragment or the complete sequence, or de- 
pending on which type of nucleic acid, i.e. DNA or RNA, is being used for the hybridiza- 

35 tion, these standard conditions vary. Thus, for example, the melting temperatures for 
DNA:DNA hybrids are approximately 10°C lower than those of DNA: RNA hybrids of the 
same length. 

Standard hybridization conditions are to be understood as meaning, depending on the 
40 nucleic acid, for example temperatures of between 42 and 58oC in an aqueous buffer 
solution with a concentration of between 0.1 and 5 x SSC (1 X SSC = 0.15 M NaCI, 15 
mM sodium citrate, pH 7.2) or additionally in the presence of 50% formamide, such as, 
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for example, 42°C in 5 x SSC, 50% formarnide. The hybridization conditions for 
DNA:DNA hybrids are advantageously 0.1 x SSC and temperatures of between ap- 
proximately 20°C and 65°C, preferably between approximately 30°C and 45°C. In the 
case of DNA:RNA hybrids, the hybridization conditions are advantageously 0.1 x SSC 
5 and temperatures of between approximately 30 °C and 65 °C, preferably between ap- 
proximately 45°C and 55 °C. These hybridization temperatures which have been stated 
are melting temperature values which have been calculated by way of example for a 
nucleic acid with a length of approx. 100 nucleotides and a G + C content of 50% in the 
absence of formamide. The experimental conditions for DNA hybridization are de- 

10 scribed in relevant textbooks of genetics such as, for example, in Sambrook et al., "Mo- 
lecular Cloning", Cold Spring Harbor Laboratory, 1989, and can be calculated using 
formulae with which the skilled worker is familiar, for example as a function of the 
length of nucleic acids, the type of the hybrids or the G + C content. The skilled worker 
will find further information on hybridization in the following textbooks: Ausubel et al. 

15 (eds), 1985, "Current Protocols in Molecular Biology", John Wiley & Sons, New York; 
Hames and Higgins (eds.), 1985, "Nucleic Acids Hybridization: A Practical Approach", 
IRL Press at Oxford University Press, Oxford; Brown (ed.), 1991, Essential Molecular 
Biology: A Practical Approach, IRL Press at Oxford University Press, Oxford. 

20 Afunctional equivalent of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, 
SEQ ID NO:9, SEQ ID NO:1 1, SEQ ID NO:3, SEQ ID NO:13, SEQ ID NO:15 or SEQ 
ID NO: 17 can be furthermore defined by the degree of homology or identity with SEQ 
ID NO:1, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, 
SEQ ID NO:3, SEQ ID NO:13, SEQ ID NO:15 or SEQ ID NO:17, respectively, and can 

25 furthermore comprise also natural or artificial mutations of the aforementioned nucleic 
acid sequences which encode a polypeptide with the activity of a nuclear encoded Clp- 
protease. 

The present invention also encompasses, for example, those nucleotide sequences 
30 which are obtained by modification of the SEQ ID NO:1 , SEQ ID NO:3, SEQ ID NO:5, 
SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:3, SEQ ID NO: 13, SEQ ID 
NO:15orSEQIDNO:17. 

For example, such modifications can be generated by techniques with which the skilled 
35 worker is familiar, such as "Site Directed Mutagenesis", "Error Prone PCR", "DNA- 
shuffling" (Nature 370, 1994, pp.389-391) or "Staggered Extension Process" (Nature 
Biotechnol. 16, 1998, pp.258-261). The aim of such a modification can be, for example, 
the insertion of further cleavage sites for restriction enzymes, the removal of DNA in 
order to truncate the sequence, the substitution of nucleotides to optimize the codons, 
40 or the addition of further sequences. Proteins which are encoded via modified nucleic 
acid sequences must retain the desired function despite a deviating nucleic acid se- 
quence. 
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The term "functional equivalents" can also relate to the amino acid sequence encoded 
by the nucleic acid sequence in question. In this case, the term "functional equivalent" 
describes a protein whose amino acid sequence has a defined percentage of identity or 
5 homology with SEQ ID NO:3. 

Functional equivalents thus also comprise naturally occurring variants of the herein- 
described sequences and artificial nucleic acid sequences, for example those which 
have been obtained by chemical synthesis and which are adapted to the codon usage, 
10 and also the amino acid sequences derived from them. 

"Genetic control sequence" describes sequences which have an effect on the transcrip- 
tion and, if appropriate, translation of the nucleic acids according to the invention in 
prokaryotic or eukaryotic organisms. Examples thereof are promoters, terminators or 

15 what are known as "enhancer" sequences. In addition to these control sequences, or 
instead of these sequences, the natural regulation of these sequences may still be pre- 
sent before the actual structural genes and may, if appropriate, have been genetically 
modified in such a way that the natural regulation has been switched off and the ex- 
pression of the target gene has been modified, that is to say increased or reduced. The 

20 choice of the control sequence depends on the host organism or starting organism. 
Genetic control sequences furthermore also comprise the 5'-untranslated region, in- 
trons or the noncoding 3-region of genes. Control sequences are furthermore under- 
stood as meaning those which make possible homologous recombination or insertion 
into the genome of a host organism or which permit removal from the genome. Genetic 

25 control sequences also comprise further promoters, promoter elements or minimal 

promoters, and sequences which have an effect on the chromatin structure (for exam- 
ple matrix attachment regions (MARs)), which can modify the expression-governing 
properties. Thus, genetic control sequences may bring about for example the additional 
dependence of the tissue-specific expression on certain stress factors. Such elements 

30 have been described, for example, for water stress, abscisic acid (Lam E and Chua 
NH, J Biol Chem 1991; 266(26): 17131 -17135), high- and iow-temperature stress 
(Plant Cell 1994, (6): 251-264) and heat stress (Molecular & General Genetics, 1989, 
217(2-3): 246-53). 

35 "Homology" between two nucleic acid sequences or polypeptide sequences is defined 
by the identity of the nucleic acid sequence/poly peptide sequence over in each case 
the entire sequence length, which is calculated by alignment with the aid of the pro- 
gram algorithm GAP according to Needleman and Wunsch 1970, J. Mol. Biol. 48; 443- 
453) setting the following parameters for polypeptides: 



40 



Gap Weight: 8 Length Weight: 2 

Average Match: 2,912 Average Mismatch:-2,003 
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and the following parameters for nucleic acids: 

Gap Weight: 50 Length Weight: 3 

5 Average Match: 10.000 Average Mismatch: 0.000 

In the following text, the term identity is also used synonymously with the term "homol- 
ogy". 

10 . "Mutations" of nucleic or amino acid sequences comprise substitutions, additions, dele- 
tions, inversions or insertions of one or more nucleotide residues, which may also bring 
about changes in the corresponding amino acid sequence of the target protein by sub- 
stitution, insertion or deletion of one or more amino acids, although the functional prop- 
erties of the target proteins are, overall, essentially retained. 

15 

"Natural genetic environment" means the natural chromosomal locus in the organism of 
origin. In the case of a genomic library, the natural genetic environment of the nucleic 
acid sequence is preferably retained at least in part. The environment flanks the nucleic 
acid sequence at least at the 5*- or 3-side and has a sequence length of at least 50 bp, 
20 preferably at least 100 bp, especially preferably at least 500 bp, very especially pref- 
erably at least 1000 bp, and most preferably at least 5000 bp. 

"Plants" for the purposes of the invention are plant cells, plant tissues, plant organs, or 
intact plants, such as seeds, tubers, flowers, pollen, fruits, seedlings, roots, leaves, 
25 stems or other plant parts. Moreover, the term plants is understood as meaning propa- 
gation material such as seeds, fruits, seedlings, slips, tubers, cuttings or root stocks. 

"Recombinant DNA" describes a combination of DNA sequences which can be gener- 
ated by recombinant DNA technology. 

30 

"Recombinant DNA technology": generally known techniques for fusing DNA se- 
quences (for example described in Sambrook et al., 1989, Cold Spring Harbor, NY, 
Cold Spring Harbor Laboratory Press). 

35 "Replication origins" ensure the multiplication of the expression cassettes or vectors 
according to the invention in microorganisms and yeasts, for example the pBR322 ori 
or the P15A ori in E. coli (Sambrook et al.: "Molecular Cloning. A Laboratory Manual", 
2nd ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989) and the 
ARS1 ori in yeast (Nucleic Acids Research, 2000, 28(10): 2060-2068). 

40 

"Reporter genes" encode readily quantifiable proteins. The transformation efficacy or 
the expression site or timing can be assessed by means of these genes via growth 
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assay, fluorescence assay, chemoluminescence assay, bioluminescence assay or re- 
sistance assay or via a photometric measurement (intrinsic color) or enzyme activity. 
Very, especially preferred in this context are reporter proteins (Schenbom E, Groskreutz 
D. Mol Biotechnol. 1999; 13(1):29-44) such as the "green fluorescent protein" (GFP) 
5 (Gerdes HH and Kaether C t FEBS Lett. 1996; 389(1):44-47; Chui WL et at., Curr Biol 
1996, 6:325-330; Leffel SM et al., Biotechniques. 23(5):912-8, 1997), chloramphenicol 
acetyl transferase, a luciferase (Giacomin, Plant Sci 1996, 116:59-72; Scikantha, J 
Bact 1996, 178:121; Millar et al., Plant Mol Biol Rep 1992 10:324-414), and luciferase 
genes, in general p-galactosidase or p-glucuronidase (Jefferson et al., EMBO J. 1987, 
10 6, 3901-3907) or the Ura3 gene. 

"Selection markers" confer resistance to antibiotics or other toxic compounds: exam- 
ples which may be mentioned in this context are the neomycin phosphotransferase 
gene, which confers resistance to the aminoglycoside antibiotics neomycin (G 418), 

15 kanamycin, paromycin (Deshayes A et al., EMBO J. 4 (1985) 2731-2737), the sul gene, 
which encodes a mutated dihydropteroate synthase (Guerineau F et al., Plant Mol Biol. 
1990; 15(1): 127-1 36), the hygromycin B phosphotransferase gene (Gen Bank Acces- 
sion NO: K 01 193) and the shble resistance gene, which confers resistance to the 
bleomycin antibiotics such as zeocin. Further examples of selection marker genes are 

20 genes which confer resistance to 2-deoxyglucose-6-phosphate (WO 98/45456) or 

phosphinothricin and the like, or those which confer a resistance to antimetabolites, for 
example the dhfr gene (Reiss, Plant Physiol. (Life Sci. Adv.) 13 (1994) 142-149). Ex- 
amples of other genes which are suitable are trpB or hisD (Hartman SC and Mulligan 
RC, Proc Natl Acad Sci U S A. 85 (1988) 8047-8051). Another suitable gene is the 

25 mannose phosphate isomerase gene (WO 94/20627), the ODC (ornithine decarboxy- 
lase) gene (McConlogue, 1987 in: Current Communications in Molecular Biology, Cold 
Spring Harbor Laboratory, Ed.) or the.Aspergillus terreus deaminase (Tamura K et al., 
Biosci Biotechnol Biochem. 59 (1995) 2336-2338). 

30 "Transformation" describes a process for introducing heterologous DNA into a pro- or 
eukaryotic cell. The term transformed cell describes not only the product of the trans- 
formation process per se, but also all of the transgenic progeny of the transgenic or- 
ganism generated by the transformation. 

35 "Target/target protein": a polypeptide encoded via the nucleic acid sequence according 
to the invention (this term is defined herein below), which may take the form of an en- 
zyme in the traditional sense or, for example, of a structural protein, a protein relevant 
for developmental processes, regulatory protein such as transcription factors, kinases, 
phosphatases, receptors, channel subunits, transport proteins, regulatory subunits 

40 which confer substrate or activity regulation to an enzyme complex. All of the targets or 
sites of action share the characteristic that their functional presence is essential for 
survival or normal development and growth. 
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"Transgenic": referring to a nucleic acid sequence, an expression cassette or a vector 
comprising a nucleic acid sequence according to the invention or an organism trans- 
formed with the abovementioned nucleic acid sequence, expression cassette or vector, 
5 the term transgenic describes all those constructs which have been generated by ge- 
netic engineering methods in which either the nucleic acid sequence of the target pro- 
tein or a genetic control sequence linked operably to the nucleic acid sequence of the 
target protein or a combination of the abovementioned possibilities are not in their natu- 
ral genetic environment or have been modified by recombinant methods. In this con- 
10 text, the modification can be achieved, for example, by mutating one or more nucleo- 
tide residues of the nucleic acid sequence in question. 

Intracellular protein degradation and its regulation are important for several processes 
like recycling of aminoacids, prevention of protein agglomeration and regulation of sig- 
15 naling processes (e.g. signalling of phytohormones). Cytosolic Proteins that are to be 
degraded are ubiquitinylated at the N-terminus and delivered to the proteasome which 
is established as a complex of a large number of protein components in eucaroytes. 
Roughly 12% of the genes in Arabidopsis thaliana are encoding proteins envolved in 
protein degradation by the ubiquitin pathway. 

20 

The stroma of plant chloroplasts contains a unique ubiquitin-independent, ATP- 
dependent protease consisting of two mayor components, a serine-type protease 
(CIpP) and an ATPase (CIpC, -D, -X) both of which are encoded by enzyme families in 
Arabidopsis thaliana (for details on the differing nomenclatures in literature see Adam 

25 et al. 2001, Plant Physiology 125, pp. 191 2-1 8). Six unique CIpP Isoforms (ClpP1-6) are 
nuclear encoded in Arabidopsis and at least one CIpP ist encoded in the plastid ge- 
nome (pCIpP) all of which carry the three conserved active site aminoacids characteris- 
tical for a catalytic triade of serine proteases. Some sequences ofmRNA for putative 
ATP-dependent protease proteolytic subunits CIpP are disclosed in Nakabayashi et al. 

30 ( Plant Cell Physiol 40: 504-514, 1999) and Kotani et al. (DNA Research 4, 291-300, 
1997). A subunit of Clp protease, which does not show any own activbity of a protease 
is disclosed in WO 2003008440 A. Further Clp gene from algae, tobacco or cyanobac- 
terium are depicted in Huang et al. (Mol. Gen. Genet 244, 151-159, 1994), Shikanai et 
al. (Plant Cell Physiol. 42, 261-273, 2001) and Clarke et al. (Plant Molecular Biology 

35 37, 791-801, 1998) respectively. 

Further three nuclear encoded ClpP-lsoforms which miss the conserved amino acid 
residues of the catalytic triade are found in Arabidopsis (ClpR1 , ClpR3, ClpR4). The 
catalytic activity of CIpR-type ClpP-lsoforms has not been shown so far. 

40 At least one CIpP and two CIpX proteins may be targeted to mitochondria in Arabidop- 
sis as deduced from N-terminal signal sequences. CIpP Proteases are conserved in 
bacteria. The CIpP protease in E.coli was formerly known as "protease Ti n . A knock out 
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of the protease Ti was shown to be not lethal. E.coli CIpP is assambled as a complex 
of 14 CIpP subunits in two heptameric rings. Co-immunoprecipitation suggests com- 
plexes of similar sizes and an ATP-dependet interaction of CIpP and CIpC subunits in 
Chloroplasts of Arabidopsis thaliana (Halperin et al. 2001, Planta 213, pp. 614-619). 
5 Furthermore, a 350kDa CIpP complex has been identified in Arabidopsis chloroplasts 
using blue native gel electrophoresis. The complex presumingly containins most of the 
known CIpP Isoenzymes (Benoit-Peltier et al. 2001, Journal of Biological Chemistry 
276, pp. 16348-16327). Consequently the complexity and redundancy of plant Clp pro- 
teases is high and detailed information about composition of the clp complex and the 
10 functional role of its subunits remain to be clarified. Particularly the role of CIpP redun- 
dancy is still unclear. * 

The CIpP subunit is capable of activly hydrolysing peptides of max. 5 aminoacids in 
vitro. CIpA.B.C subunits constitute ATP-hydrolysing chaperones which unfold target- 

15 proteins and present them for hydrolysis to CIpP (Porankiewicz et al. 1999, Molecular 
Microbiology 32, 449-458). Involvement of CIpP in the degradation of the cytochrome 
b6f complex an PSII has been decribed in Chlamydomonas (Majeran et al. 2001, Plant 
Physiology 23+, pp. 421-433). Functional properties of CIpR-type Clp-Proteases as 
well as the CIpP like Proeases are yet to be determined. 

20 Surprisingly, it has been found within the scope of the present invention that plants in 
which a Clp protease was reduced in a selective manner have phenotypes which are 
comparable with phenotypes generated by herbicide application. Drastic growth retar- 
dation and damage such as were observed. 

25 The present invention relates to the use of a polypeptide, which has the activity of nu- 
clear encoded Clp-protease in a method for identifying herbicides, preferably of a poly- 
peptide, which has the activity of nuclear encoded Clp-protease, which is 

a) selected from the group consisting ,of ClpP1 -protease, ClpP2-protease, ClpP3- 
30 protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, Clp-- 
protease; or 

35 c) ClpP-like-protease, wherein more preferably 

a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 

a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:1,or 
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i) 
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ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
. code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

5 Hi) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 

which has an identity with SEQ ID NO:1 of has at least 50%; or 



m 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1, which is encoded by an amino acid sequence that has at least an 
1 0 identity of 50% with the SEQ ID NO:2; 

b) the ClpP2-protease encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:3, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by back translating, or 



20 



iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:3, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:4; 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO:6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:5 
which has an identity with SEQ ID NO:5 of has at least 50%;or 



40 



iv) 



a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:5, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:6; 



55117 



11 

the ClpP4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:7, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:7 
which has an identity with SEQ ID NO:7 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:7, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:8; 

the ClpP6-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:1 0 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:9 
which has an identity with SEQ ID NO:9 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:9, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 10; 

the ClpR1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11,or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by back translating, or 
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iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 1 1 , which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 12; 

the ClpR3-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:13, or * . 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 13 
which has an identity with SEQ ID NO: 13 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 13, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:14; 

the ClpR4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:15, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 16 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 15 
which has an identity with SEQ ID NO:15 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 16; 

the CIpP like-protease is encoded by a nucleic acid sequence which comprises: 
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i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
5 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO: 18 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 17 
which has an identity with SEQ ID NO:17 of has at least 50%; or 

10 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 18; 

15 wherein the sequences b) i-iv, e) i-iv, f) i-iv and are especially preferred 

The term "comprising" in relation to a nucleic acid sequence means that the nucleic 
acid sequence can be flanked by additional nucleic acid sequences that have on the 5' 
end and on the 3' end or on the 5'end or on the 3' end on the end a sequence length of 
at least 1000 bp, preferably at least 500 bp, more preferably at least 250bp, most pref- 
. 20 erably at least 100bp. 

The functional equivalent according to the invention of SEQ ID NO:1 as described in a) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
25 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:1 .. 

30 The functional equivalents of the nucleic acid sequence SEQ ID NO:1 set forth in a) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 

35 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:2. 

The functional equivalent according to the invention of SEQ ID NO: 3 as described in b) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
40 protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 



'■■ PF'55117 



14 

81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91 %, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:3. 

An example of a functional equivalent of SEQ ID NO: 3 is the nucleic acid sequence of 
5 Arabidopsis thaliana (Gene Bank Acc. No. AB022327). This sequence is herein incor- 
porated by reference. 

The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:3 in b) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 

10 Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:4. 
L - 15 • 

The functional equivalent according to the invention of SEQ ID NO:5 as described in c) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 

20 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:5. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:5 set forth in c) iv. 

25 are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 

30 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:6. 

The functional equivalent according to the invention of SEQ ID NO:7 as described in d) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
35 58%, 59%, 60%, 61 %, 62%. 63%. 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91 %, 
92%. 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:7. 

40 The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:7 in d) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 



i. 



PF'55117 



15 

57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:7. 

5 

The functional equivalent according to the invention of SEQ ID NO:9 as described in e) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
10 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:9. 

0 <: The functional equivalents of the nucleic acid sequence set forth SEQ ID NO:9 in e) iv. 

15 are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 

20 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 10. 

The functional equivalent according to the invention of SEQ ID NO:1 1 as described in f) 
iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
25 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 1 1 . 



30 The functional equivalents of the nucleic acid sequence SEQ ID NO:1 1 set forth in f) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 

35 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:12. 



40 



An example of a functional equivalent of SEQ ID NO: 1 1 is the nucleic acid sequence 
of Arabidopsis thaliana (Gene Bank Acc. No. AB022330). This sequence is herein in- 
corporated by reference. 



PF 55117 

16 

The functional equivalent according to the invention of SEQ ID NO:13 as described in 

g) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 

5 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:13. 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 1 3 set forth in g) 
1 0 iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
15 91 %, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:14. 

The functional equivalent according to the invention of SEQ ID NO: 15 as described in 

h) iii), which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 

20 58%, 59%, 60%, 61 %, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 15. 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 15 set forth in h) 
iv. are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%. 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:16. 

The functional equivalent according to the invention of SEQ ID NO: 17 as described in i) 
which encodes a polypeptide, which has the activity of nuclear encoded Clp- 
35 protease, and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 
58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 69%, 
70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 80%, 
81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 91%, 
92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO: 17. 

40 

The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in i) iv. 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
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Clp-protease and has at least an identity of 50%, 51%, 52%, 53%, 54%, 55%, 56%, 
57%, 58%, 59%, 60%, 61%, 62%, 63%, 64% or 65% or preferably of 66%, 67%, 68%, 
69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78% or 79% more preferably of 
80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89% or 90% most preferably of 
91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% with SEQ ID NO:18. 

An example of a functional equivalent of SEQ ID NO: 17 is the nucleic acid sequence 
of Arabidopsis thaliana (Gene Bank Acc. No. AK1 18525). This sequence is herein in- 
corporated by reference. 

Furthermore claimed within the scope of the present invention are plant nucleic acid 
sequence 

I) encoding a ClpP2-protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:3, or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:3 of has at least 66%; or 

d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1 1 , which is encoded by an amino acid sequence that has at least an 
identity of 76% with the SEQ ID NO:4; 

II) encoding a ClpR1 -protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11,or 

b) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by backtranslating, or 



c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
which has an identity with SEQ ID NO:1 1 of has at least 69%: or 
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d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1 1, which is encoded by an amino acid sequence that has at least an 
identity of 71% with the SEQ ID NO: 12; 

5 III) encoding a ClpP-like-protease comprising: 

a) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO: 17, or 

10 b) a nucleic acid sequence which, owing to the degeneracy of the genetic 

code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 18 by backtranslating, or 

c) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 
15 which has an identity with SEQ ID NO: 17 of has at least 67%; or 

d) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 79% with the SEQ ID NO: 18; 

20 The functional equivalent of SEQ ID NO: 3 set forth in I c) has at least an identity of 
66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, by preference at least 75%, 76%, 
77%, 78%, 79%, 80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 
88%, 89%, 90%, 91 %, 92% or 93%, especially preferably at least 94%, 95%, 96%, 
97%, 98% or 99% with SEQ ID NO:3. 

25 

The functional equivalents of the nucleic acid sequence SEQ ID NO:3 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
(||| Clp-protease and has at least an identity of 77%, by preference at least 78%, 79%, 

80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
30 91%, 92%, 93%, especially preferably at least 94%, 95%, 96%, 97%, 98%, 99% with 
SEQ ID NO:4. 

The functional equivalent of SEQ ID NO:1 1 set forth in II c) has at least an identity of 
69%, 70%, 71%, 72%, 73% or 74%, by preference at least 75%, 76%, 77%, 78%, 79%, 
35 80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 
91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 97%, 98% or 99% 
withSEQIDNO:11. 

The functional equivalents of the nucleic acid sequence SEQ ID NO:1 1 set forth in II) d) 
40 are encoded by an amino acid sequence, which has the activity of nuclear encoded 

Clp-protease and has at least an identity of 71% by preference at least 72%, 73%,74%, 
75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, preferably at least 84%, 85%, 86%, 
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87%, 88%, 89%, 90%, 91%, 92%, 93%, especially preferably at least 94%, 95%, 96%, 
97%, 98%, 99% with SEQ ID NO:12. 

The functional equivalent of SEQ ID NO: 17 set forth in I c) has at least an identity of 
5 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, by preference at least 75%, 76%, 77%, - 
78%, 79%, 80%, 81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 
89%, 90%, 91%, 92% or 93%, especially preferably at least 94%, 95%, 96%, 97%, 
98% or 99% with SEQ ID NO: 17. 

10 The functional equivalents of the nucleic acid sequence SEQ ID NO: 17 set forth in I) d) 
are encoded by an amino acid sequence, which has the activity of nuclear encoded 
Clp-protease and has at least an identity of 79%, by preference at least 79%, 80%, 
81%, 82% or 83%, preferably at least 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 
92%, 93%, especially preferably at least 94%, 95%, 96%, 97%, 98%, 99% with SEQ ID 

15 NO:18. 

The polypeptides encoded by the abovementioned nucleic acid sequences according 
to I c)-d), II c)-d) and III c)-d) are likewise claimed. The functional equivalents as de- 
scribed in c) and d) are distinguished by the same functionality, i.e. they have the activ- 
20 ity of a clp-protease. 

The nucleic acid sequences I c)-d), II c)-d) and III c)-d) are hereinbelow termed NCLP- 
sequences. 

25 The term "nucleic acid sequences according to the invention" which is used hereinbe- 
low refers to nucleic acid sequences encoding a polypeptide, which has the activity of 
nuclear encoded Clp-protease in a method for identifying herbicides, preferably of a 
polypeptide, which has the activity of nuclear encoded Clp-protease, which is 

30 a) selected from the group consisting of ClpP1 -protease, ClpP2-protease, ClpP3- 
protease, ClpP4-protease and ClpP6-protease; or 

b) selected from the group consisting of ClpR1 -protease, ClpR3-protease, ClpR4- 
protease; or 

35 

c) ClpP-like-protease, wherein more preferably 

a) the ClpP1 -protease is encoded by a nucleic acid sequence which comprises: 

40 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:1, or 
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ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:2 by back translating, or 

5 iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 

which has an identity with SEQ ID NO:1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:1, which is encoded by an amino acid sequence that has at least an 
10 identity of 50% with the SEQ ID NO:2; 

b) the ClpP2-protease encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
15 NO:3, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:4 by back translating, or 

20 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:3 
which has an identity with SEQ ID NO:3 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
25 NO:3, which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:4; 

c) the ClpP3-protease is encoded by a nucleic acid sequence which comprises: 

30 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO:5, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

35 NO:6 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:5 
which has an identity with SEQ ID NO:5 of has at least 50%;or 

40 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO: 5, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:6; 
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the ClpP4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:7, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:8 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:7 
which has an identity with SEQ ID NO:7 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:7, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:8; 

the ClpP6-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:9, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO:1 0 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:9 
which has an identity with SEQ ID NO:9 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 
NO:9, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 10; 

the ClpR1 -protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:11, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 12 by back translating, or 
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iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO:1 1 
which has an identity with SEQ ID NO:1 1 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

5 NO:1 1 , which is encoded by an amino acid sequence that has at least an 

identity of 50% with the SEQ ID NO:1 2; 

g) the ClpR3-protease is encoded by a nucleic acid sequence which comprises: 

10 i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 

NO: 13, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 

15 NO: 14 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 13 
which has an identity with SEQ ID NO:13 of has at least 50%; or 

20 iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO: 13, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO:14; 



25 



35 



h) the ClpR4-protease is encoded by a nucleic acid sequence which comprises: 

i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:15, or 



ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
30 code, can be deduced from the amino acid sequence shown in SEQ ID 

NO: 16 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 15 
which has an identity with SEQ ID NO: 15 of has at least 50%; or 



iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

NO: 15, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 16; 



40 i) the CIpP like-protease is encoded by a nucleic acid sequence which comprises: 
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i) a nucleic acid sequence with the nucleic acid sequence shown in SEQ ID 
NO:17, or 

ii) a nucleic acid sequence which, owing to the degeneracy of the genetic 
code, can be deduced from the amino acid sequence shown in SEQ ID 
NO: 18 by back translating, or 

iii) a functional equivalent of nucleic acid sequence shown in SEQ ID NO: 17 
which has an identity with SEQ ID NO: 17 of has at least 50%; or 

iv) a functional equivalent of the nucleic acid sequence shown in SEQ ID 

. NO: 17, which is encoded by an amino acid sequence that has at least an 
identity of 50% with the SEQ ID NO: 18; 

15 wherein the sequences b) i-iv, e) i-iv, f) i-iv and are especially preferred 

A polypeptide, which has the activity of nuclear encoded CIp-protease and is encoded 
by a nucleic acid sequence according to the invention are hereinbelow simply referred 
to as "CLP". 



10 



20 



Reduced amounts of glyoxysomal CLP cause growth retardation and necrotic and 
chlorotic leaves in plants. 



The gene products of the nucleic acids according to the invention constitute novel tar- 
25 gets for herbicides, which make possible the provision of novel herbicides for control- 
ling undesired plants. Moreover, the gene products of the nucleic acids according to 
the invention constitute novel targets for growth regulators which make possible the 
provision of novel growth regulators for regulating the growth of plants. 

30 Undesired plants are understood as meaning, in the broadest sense, all those plants 
which grow at locations where they are undesired, for example: 

Dicotyledonous weeds of the genera: Sinapis, Lepidium, Galium, Stellaria, Matricaria, 
Anthemis, Galinsoga, Chenopodium, Urtica, Senecio, Amaranthus, Portulaca, Xan- 
35 thium, Convolvulus, Ipomoea, Polygonum, Sesbania, Ambrosia, Cirsium, Carduus, 
Sonchus, Solanum, Rorippa, Rotala, Lindernia, Lamium, Veronica, Abutilon, Emex, 
Datura, Viola, Galeopsis, Papaver, Centaurea, Trifolium, Ranunculus, Taraxacum. 

Monocotyledonous weeds from the genera: Echinochloa, Setaria, Panicum, Digitaria, 
40 Phleum, Poa, Festuca, Eleusine, Brachiaria, Lolium, Bromus, Avena, Cyperus, Sor- 
ghum, Agropyron, Cynodon, Monochoria, Fimbristylis, Sagittaria, Eleocharis, Scirpus, 
Paspalum, Ischaemum, Sphenoclea, Dactyloctenium, Agrostis, Alopecurus, Apera. 
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SEQ IDNO:1; 3, 5, 7, 9, 11, 13, 15, 17 or 19-21 or parts of SEQ ID NO: 1; 3, 5, 7, 9, 
11, 13, 15, 17 or 19-21 can be used for the preparation of hybridization probes. The 
preparation of these probes and the experimental procedure is known. For example, 
5 this can be effected via the selective preparation of radioactive or nonradioactive 

probes by PCR and the use of suitably labeled oligonucleotides, followed by hybridiza- 
tion experiments. The technologies required for this purpose are detailed, for example, 
in T. Maniatis, E.F. Fritsch and J. Sambrook, "Molecular Cloning: A Laboratory Man- 
ual", Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989). The probes in 
10 question can furthermore be modified by standard technologies (Lit. SDM or random 
mutagenesis) in such a way that they can be employed for further purposes, for exam- 
ple as a probe which hybridizes specifically with mRNA and the corresponding coding 
sequences in order to analyze the corresponding sequences in other organisms. 

V 

15 The abovementioned probes can be used for the detection and isolation of functional 
equivalents of SEQ ID NO:2, 4, 6, 8, 10, 12, 14, 16 or 18 from other plant species on 
the basis of sequence identities; In this context, part or all of the sequence of the SEQ 
ID NO:2 in question is used as a probe for screening a genomic or cDNA library of the 
plant species in question or in a computer search for sequences of functional equiva- 

20 lents in electronic databases. 

Preferred plant species are the undesired plants which have already been mentioned 
at the outset. 

25 The invention furthermore relates to expression cassettes comprising 

a) genetic control sequences in operable linkage with a NCLP sequence; or 

b) additional functional elements, or 

30 

c) a combination of a) and b); 

and to the use of expression cassettes comprising 

35 a) genetic control sequences in operable linkage with a nucleic acid sequence ac- 
cording to the invention, 

b) additional functional elements, or 



40 



c) a combination of a) and b); 
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for expressing a CLP, which can be used in in vitro assay systems. Both embodiments 
of the above-described expression cassettes are referred in the following text as ex- 
pression cassette according to the invention. 

5 In a preferred embodiment, an expression cassette according to the invention com- 
prises a promoter at the 5' end of the coding sequence and, at the 3' end, a transcrip- 
tion termination signal and, if appropriate, further genetic control sequences which are 
linked operably with the interposed nucleic acid sequence according to the invention. 

10 The expression cassettes according to the invention are also understood as meaning 
analogs which can be brought about, for example, by a combination of the individual 
nucleic acid sequences on a polynucleotide (multiple constructs), on a plurality of 
polynucleotides in a cell (cotransformation) or by sequential transformation. 

1 5 Advantageous genetic control sequences under point a) for the expression cassettes 
according to the invention or for vectors comprising expression cassettes according to 
the invention are, for example, promoters such as the cos, tac, trp, tet, Ipp, lac, laclq, 
T7, T5, T3, gal, trc, ara, SP6, d-PR or the D-PL promoter, all of which can be used for 
expressing a CLP, in Gram-negative bacterial strains. 

20 

Examples of further advantageous genetic control sequences are present, for example, 
in the promoters amy and SP02, both of which can be used for expressing a CLP, in 
Gram-positive bacterial strains, and in the yeast or fungal promoters AUG1, GPD-1, 
PX6, TEF, CUP1, PGK, GAP1, TPI, PH05, AOX1, GAL10/CYC1, CYC1, OliC, ADH, 

25 TDH, Kex2, MFA or NMT or combinations of the abovementioned promoters (Degryse 
et al., Yeast 1995 June 15; 1 1(7):629-40; Romanos et al. Yeast 1992 June;8(6):423- 
88; Benito et al. Eur. J. Plant Pathol. 104, 207-220 (1998); Cregg et al. Biotechnology 
(N Y) 1993 Aug;11(8):905-10; Luo X., Gene 1995 Sep 22;163(1):127-31: Nacken et al., 
Gene 1996 Oct 10;175(1-2): 253-60; Turgeon et al., Mol Cell Biol 1987 Sep;7(9):3297- 

30 305) or the transcription terminators NMT, Gcy1 , TrpC, AOX1 , nos, PGK or CYC1 (De- 
gryse et al., Yeast 1995 June 15; 11(7):629-40; Brunelli et al. Yeast 1993 (Dec9(12): 
1309-18; Frisch et al., Plant Mol. Biol. 27(2), 405-409 (1995); Scorer et al., Biotechnol- 
ogy (N.Y. 12 (2), 181-184 (1994), Genbank acc. number Z46232; Zhao et al. Genbank 
acc number : AF049064; Punt et al., (1987) Gene 56 (1), 1 17-124), all of which can be 

35 used for expressing CLP, in yeast strains. 

Examples of genetic control sequences which are suitable for expression in insect cells 
are the polyhedrin promoter and the p10 promoter (Luckow, V.A. and Summers, M.D. 
(1988) Bio/Techn. 6, 47-55). 

40 

Advantageous genetic control sequences for expressing CLP, in cell culture, in addition 
to polyadenylation sequences such as, for example, from simian virus 40, are eu- 
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karyotic promoters of viral origin such as, for example, promoters of the polyoma virus, 
adenovirus 2, cytomegalovirus or simian virus 40. 

Further advantageous genetic control sequences for expressing nuclear encoded Clp 
5 Protease, in plants are present in the plant promoters CaMV/35S [Franck et al., Cell 
21(1980) 285-294], PRP1 [Ward etal., Plant Mol. Biol. 22 (1993)], SSU, OCS, LEB4, 
USP, STLS1, B33, NOS; FBPaseP (WO 98/18940) or in the ubiquitin or phaseolin 
promoter; a promoter which is preferably used being, in particular, a plant promoter or 
a promoter derived from a plant virus. Especially preferred are promoters of viral origin 

10 such as the promoter of the cauliflower mosaic virus 35S transcript (Franck et al., Cell 
21 (1980), 285-294; Odell et al., Nature 313 (1985), 810-812). Further preferred consti- 
tutive promoters are, for example, the agrobacterium nopaline synthase promoter, the 
TR double promoter, the agrobacterium OCS (octopine synthase) promoter, the ubiq- 
uitin promoter, (Holtorf S et al., Plant Mol Biol 1 995, 29:637-649), the promoters of the 

1 5 vacuolar ATPase subunits, or the promoter of a proline-rich wheat protein (WO 
91/13991). 

The expression cassettes may also comprise, as genetic control sequence, a chemi- 
cally inducible promoter, by which the expression of the exogenous gene in the plant 

20 can be controlled at a specific point in time. Such promoters, such as, for example, the 
PRP1 promoter (Ward et al., Plant. Mol. Biol. 22 (1993), 361-366), a salicylic-acid- 
inducible promoter (WO 95/19443), a benzenesulfonamide-inducible promoter (EP-A- 
0388186), a tetracyclin-inducible promoter (Gatz et al., (1992) Plant J. 2, 397404), an 
abscisic-acid-inducible promoter (EP-A 335528) or an ethanol- or cyclohexanone- 

25 inducible promoter (WO 93/21334) may also be used. 

Furthermore, suitable promoters are those which confer tissue- or organ-specific ex- 
pression in, for example, anthers, ovaries, flowers and floral organs, leaves, stomata, 
trichomes, stems, vascular tissues, roots and seeds. Others which are suitable in addk 

30 tion to the abovementioned constitutive promoters are, in particular, those promoters 
which ensure leaf-specific expression. Promoters which must be mentioned are the 
potato cytosolic FBPase promoter (WO 97/05900), the rubisco (ribulose-1,5- 
bisphosphate carboxylase) SSU (small subunit) promoter or the ST-LSI promoter from 
potato (Stockhaus et al., EMBO J. 8 (1989), 2445 - 245). Promoters which are further- 

35 more preferred are those which control expression in seeds and plant embryos. Exam- 
ples of seed-specific promoters are the phaseolin promoter (US 5,504,200, Bustos MM 
et al., Plant Cell. 1989;1(9):839-53), the promoter of the 2S albumin gene (Joseffson 
LG et al., J Biol Chem 1987, 262:12196-12201), the legumin promoter (Shirsat A et al., 
Mol Gen Genet, 1989;2T5(2):326-331), the USP (unknown seed protein) promoter 

40 (Baumlein H et al., Molecular & General Genetics 1991, 225(3):459-67), the napin 
gene promoter (Stalberg K, et al., L Planta 1996, 199:515-519), the sucrose binding 
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protein promoter (WO 00/26388) or the LeB4 promoter (Baumlein H et al., Mol Gen 
Genet 1991, 225: 121-128; Fiedler, U. et al., Biotechnology (NY) (1995), 13 (10) 1090). 

Further promoters which are suitable as genetic control sequences are, for example, 
5 specific promoters for tubers, storage roots or roots, such as, for example, the class I 
patatin promoter (B33), the potato cathepsin D inhibitor promoter, the starch synthase 
(GBSS1) promoter or the sporamin promoter, fruit-specific promoters such as, for ex- 
ample, the fruit-specific promoter from tomato (EP-A 409625), fruit-maturation-specific 
promoters such as, for example, the fruit-maturation-specific promoter from tomato 

10 (WO 94/21794), inflorescence-specific promoters such as, for example, the phytoene 
synthase promoter (WO 92/16635) or the promoter of the P-rr gene (WO 98/22593), or 
plastid- or chromoplast-specific promoters such as, for example, the RNA polymerase 
promoter (WO 97/06250), or else the Glycine max phosphoribosyl-pyrophosphate ami- 
dotransferase promoter (see also Genbank Accession No. U87999), or another node- 

15 specific promoter as described in EP-A 249676. 

Additional functional elements b) are understood as meaning, by way of example but 
not by limitation, reporter genes, replication origins, selection markers and what are 
known as affinity tags, in fusion with CLP, directly or by means of a linker optionally 
20 comprising a protease cleavage site. Further suitable additional functional elements are 
sequences which ensure that the product is targeted into the apoplasts, into plastids, 
the vacuole, the mitochondrion, the peroxisome, the endoplasmatic reticulum (ER) or, 
owing to the absence of such operative sequences, remains in the compartment where 
it is formed, the cytosol, (Kermode, Crit Rev. Plant Sci. 15, 4 (1996), 285-423). 

25 

Also in accordance with the invention are vectors comprising at least one copy of the 
nucleic acid sequences according to the invention and/or the expression cassettes ac- 
cording to the invention. 

30 In addition to plasmids, vectors are furthermore also understood as meaning all of the 
other known vectors with which the skilled worker is familiar, such as, for example, 
phages, viruses such as SV40, CMV, baculovirus, adenovirus, transposons, IS ele- 
ments, phasmids, phagemids, cosmids or Nnear or circular DNA. These vectors can be 
replicated autonomously in the host organism or replicated chromosomally; chromo- 

35 somal replication is preferred. 

In a further embodiment of the vector, the nucleic acid construct according to the inven- 
tion can advantageously also be introduced into the organisms in the form of a linear 
DNA and integrated into the genome of the host organism via heterologous or homolo- 
40 gous recombination. This linear DNA may consist of a linearized plasmid or only of the 
nucleic acid construct as vector, or the nucleic acid sequences used. 



L 
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Further prokaryotic or eukaryotic expression systems are mentioned in Chapters 16 
and 17 in Sambrook et al., "Molecular Cloning: A Laboratory Manual." 2nd ed., Cold 
Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989. Further advantageous vectors are described in Hellens et al. (Trends in 
5 plant science, 5, 2000). 

The expression cassette according to the invention and vectors derived therefrom can 
be used for transforming bacteria, cyanobacteria, (for example of the genus Synecho- 
cystis, Anabaena, Calothrix, Scytonema, Oscillatoria, Plectonema and Nostoc), proteo- 
10 bacteria such as, for example, Magnetococcus sp. MC1, yeasts, filamentous fungi and 
algae and eukaryoatic nonhuman cells' (for example insect cells) with the aim of pro- 
ducing CLP, recombinantly, the generation of a suitable expression cassette depending 
on the organism in which the gene is to be expressed. 

15 Vectors comprising a NCLP sequence form part of the subject-matter of the present 
invention. 

In a further advantageous embodiment, the nucleic acid sequences according to the 
invention may also be introduced into an organism by themselves. 

20 

If, in addition to the nucleic acid sequences, further genes are to be introduced into the 
organism, they can all be introduced into the organism together in a single vector, or 
each individual gene can be introduced into the organism in each case in one vector, it 
being possible to introduce the different vectors simultaneously or in succession. 

25 

In this context, the introduction, into the organisms in question (transformation), of the 
nucleic acid(s) according to the invention, of the expression cassette or of the vector 
can be effected in principle by all methods with which the skilled worker is familiar. 

30 In the case of microorganisms, the skilled worker will find suitable methods in the text- 
books by Sambrook, J. et al. (1989) "Molecular cloning: A laboratory manual", Cold 
Spring Harbor Laboratory Press, von F.M. Ausubel et al. (1994) "Current protocols in 
molecular biology", John Wiley and Sons, by D.M. Glover et al., DNA Cloning Vol.1, 
(1995), IRL Press (ISBN 019-963476-9), by Kaiser et al. (1994). Methods in Yeast Ge- 

35 netics, Cold Spring Habor Laboratory Press or Guthrie et al. "Guide to Yeast Genetics 
and Molecular Biology", Methods in Enzymology, 1994, Academic Press. In the trans- 
formation of filamentous fungi, the methods of choice are firstly the generation of pro- 
toplasts and transformation with the aid of PEG (Wiebe et al. (1997) Mycol. Res. 101 
(7): 971-877; Proctor et al. (1997) Microbiol. 143, 2538-2591), and secondly the trans- 

40 formation with the aid of Agrobacterium tumefaciens (de Groot et al. (1998) Nat. Bio- 
tech. 16, 839-842). 
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In the case of dicots, the methods which have been described for the transformation 
and regeneration of plants from plant tissues or plant cells can be exploited for tran- 
sient or stable transformation. Suitable methods are the biolistic method or the trans- 
formation of protoplasts (cf., for example, Willmitzer, L, 1993 Transgenic plants. In: 
5 Biotechnology, A Multi-Volume Comprehensive Treatise (H.J. Rehm, G. Reed, A. 
Puhler, P. Stadler, eds.) t Vol. 2, 627-659, VCH Weinheim-New York-Basle- 
Cambridge), electroporation, the incubation of dry embryos in DNA-containing solution, 
microinjection and the agrobacterium-radiated gene transfer. The abovementioned 
■ methods are described, for example, in B. Jenes et al., Techniques for Gene Transfer, 
10 in: Transgenic Plants, Vol. 1, Engineering and Utilization, edited by S.D. Kung and R. 
Wu, Academic Press (1993) 128-143 and in Potrykus, Annu. Rev. Plant Physiol. Plant 
Molec.Biol. 42 (1991) 205-225). 

The transformation by means of agrobacteria, and the vectors to be used for the trans- 
15 formation, are known to the skilled worker and described extensively in the literature 
(Bevan et al., Nucl. Acids Res. 12 (1984) 871 1 . The intermediary vectors can be inte- 
grated into the agrobacterial Ti or Ri plasmid by means of homologous recombination 
owing to sequences which are homologous to sequences in the T-DNA. This plasmid 
additionally contains the vir region, which is required for the transfer of the T-DNA. In- 
20 termediary vectors are not capable of replication in agrobacteria. The intermediary vec- 
tor can be transferred to Agrobacterium tumefaciens by means of a helper plasmid 
(conjugation). Binary vectors are capable of replication both in E. coli and in agrobacte- 
ria. They contain a selection marker gene and a linker or polylinker which are framed 
by the right and left T-DNA border region. They can be transformed directly into the. 
25 agrobacteria (Holsters et al. Mol. Gen. Genet. 163 (1978), 181-187), EP A 0 120 516; 
Hoekema, in: The Binary Plant Vector System Offsetdrukkerij Kanters B.V., Alblasser- 
dam (1985), Chapter V; Fraley et al., Crit. Rev. Plant. Sci., 4: 1-46 and An et al. EMBO 
J. 4(1985), 277-287). 

30 The transformation of monocots by means of vectors based on agrobacterium has also 
been described (Chan et al., Plant Mol. Biol. 22(1993), 491-506; Hiei et al., Plant J. 6 
(1994) 271-282; Deng et al. Science in China 33 (1990), 28-34; Wilmink et al., Plant 
Cell Reports 11,(1992) 76-80; May et al. Biotechnology 13 (1995) 486-492; Conner and 
Domisse; Int. J. Plant Sci. 153 (1992) 550-555; Ritchie et al. Transgenic Res. (1993) 

35 252-265). Alternative systems for the transformation of monocots are the transforma- 
tion by means of biolistic approach (Wan and Lemaux; Plant Physiol. 104 (1994), 37- 
48; Vasil et al. Biotechnology 1 1 (1992), 667-674; Ritala et al., Plant Mol. Biol 24, 
(1994) 317-325; Spencer et al., Theor. Appl. Genet. 79 (1990), 625-631), protoplast 
transformation, the electroporation of partially permeabilized cells, and the introduction 

40 of DNA by means of glass fibers. In particular the transformation of maize has been 
described repeatedly in the literature (cf., for example, WO 95/06128; EP 0513849 A1; 
EP 0465875 A1 ; EP 0292435 A1 ; Fromm et al., Biotechnology 8 (1990), 833-844; 



PF 55117 



30 

Gordon-Kamm et al., Plant Cell 2 (1990), 603-618; Koziel et al., Biotechnology 
11(1993) 194-200; Moroc et al., Theor Applied Genetics 80 (190) 721-726). 

The successful transformation of other cereal species has also already been described 
5 for example in the case of barley (Wan and Lemaux, see above; Ritala et al., see 
above; wheat (Nehra et al., Plant J. 5(1994) 285-297). . . 

Agrobacteria which have been transformed with a vector according to the invention can 
likewise be used in a known manner for the transformation of plants, such as test 

10 plants like Arabidopsis or crop plants like cereals, maize, oats, rye, barley, wheat, soya, 
rice, cotton, sugarbeet, canola, sunflower, flax, hemp, potato, tobacco, tomato, carrot, 
capsicum, oilseed rape, tapioca, cassava, arrowroot, Tagetes, alfalfa, lettuce and the 
various tree, nut and grapevine species, for example by bathing scarified leaves or leaf 
segments in an agrobacterial solution and subsequently growing them in suitable me- 

15 dia. 

The genetically modified plant cells can be regenerated via all methods with which the 
skilled worker is familiar. Such methods can be found in the abovementioned publica- 
tions by S.D. Kung and R. Wu, Potrykus or Hofgen and Willmitzer. 

20 

The transgenic organisms generated by transformation with one of the above- 
described embodiments of an expression cassette comprising a nucleic acid sequence 
according to the invention or a vector comprising the abovementioned expression cas- 
sette, and the recombinant CLP, which can be obtained from the transgenic organism 
25 by means of expression, form part of the subject matter of the present invention. The 
use of transgenic organisms comprising an expression cassette according to the inven- 
tion, for example for providing recombinant protein, and/or the use of these organisms 
in in-vivo assay systems likewise form part of the subject matter of the present inven- 
tion. 

30 

Preferred organisms for the recombinant expression are not only bacteria, yeasts, 
mosses, algae and fungi, but also eukaryotic cell lines. 

Preferred mosses are Physcomitrella patens or other mosses described in Kryptoga- 
35 men [Cryptogamia], Vol.2, Moose, Fame [Mosses, Ferns], 1991, Springer Verlag (ISBN 
3540536515). 

Preferred within the bacteria are, for example, bacteria from the genus Escherichia, 
Erwinia, Flavobacterium, Alcaligenes or cyanobacteria, for example from the genus 
40 Synechocystis, Anabaena, Calothrix, Scytonema, Oscillatoria, Plectonema and Nostoc, 
especially preferably Synechocystis or Anabaena. 
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Preferred yeasts are Candida, Saccharomyces, Schizosaccheromyces, Hansenula or 
Pichia. 

Preferred fungi are Aspergillus, Trichoderma, Ashbya, Neurospora, Fusarium, Beauve- 
5 ria, Mortierella, Saprolegnia, Pythium, or other fungi described in Indian Chem Engr. 
Section B. Vol 37, No 1,2 (1995). 

Preferred plants are selected in particular among monocotyledonous crop plants such 
as, for example, cereal species such as wheat, barley, sorghum or millet, rye, triticale, 

10 maize, rice or oats, and sugarcane. The transgenic plants according to the invention 
are, furthermore, in particular selectedfrom among dicotyledonous crop plants such as, 
for example, Brassicaceae such as oilseed rape, cress, Arabidopsis, cabbages or ca- 
nola; Leguminosae such as soyabean, alfalfa, pea, beans or peanut, Solanaceae such 
as potato, tobacco, tomato, egg plant or capsicum; Asteraceae such as sunflower, 

15 Tagetes, lettuce or Calendula; Cucurbitaceae such as melon, pumpkin/squash or zuc- 
chini, or linseed, cotton, hemp, flax, red pepper, carrot, sugar beet, or various tree, nut 
and grapevine species. 

In principle, transgenic animals such as, for example, C. elegans, are also suitable as 
20 host organisms. 

Also preferred is the use of expression systems and vectors which are available to the 
public or commercially available. 

25 Those which must be mentioned for use in E. coli bacteria are the typical advanta- 
geous commercially available fusion and expression vectors pGEX [Pharmacia Biotech 
Inc; Smith, D.B. and Johnson, K.S. (1988) Gene 67:31-40], pMAL (New England Bio- 
labs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ), which contains glutathione 
S transferase (GST), maltose binding protein or protein A, the pTrc vectors (Amann et 

30 al., (1988) Gene 69:301-315), u pKK233-2" from CLONTECH, Palo Alto, CA and the 
"pET, and the "pBAD" vector series from Stratagene, La Jolla and the TOPO-TA vec- 
tor series drom Invitrogen. 

Further advantageous vectors for use in yeast are pYepSed (Baldari, et al., (1987) 
35 Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 
(Schultz et al., (1987) Gene 54:1 13-123), and pYES derivatives, pGAPZ derivatives, 
pPICZ derivatives, and the vectors of the "Pichia Expression Kit" (Invitrogen Corpora- 
tion, San Diego, CA). Vectors for use in filamentous fungi are described in: van den 
Hondel, C.A.M.J.J. & Punt, P.J. (1991) "Gene transfer systems and vector develop- 
40 ment for filamentous fungi, in: Applied Molecular Genetics of Fungi, J.F. Peberdy, et 
al., eds., p. 1-28, Cambridge University Press: Cambridge. 
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As an alternative, insect cell expression vectors may also be used advantageously, for 
example for expression in Sf9, Sf21 or Hi5 cells, which are infected via recombinant 
Baculoviruses. Examples of these are the vectors of the pAc series (Smith et al. (1983) 
Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) Virol- 
5 ogy 170:31-39). Others which may be mentioned are the Baculovirus expression sys- * 
terns "MaxBac 2.0 Kit" and "Insect Select System" from Invitrogen, Carlsbad or 
"BacPAK Baculovirus Expression System" from CLONTECH, Palo Alto, CA. Insect 
cells are particularly suitable for overexpressing eukaryotic proteins since they effect 
posttranslational modifications of the proteins which are not possible in bacteria and 
10 yeasts. The skilled worker is familiar with the handling of cultured insect cells and with 
their infection for expressing proteins, which can be carried out analogously to known 
methods (Luckow and Summers, Bio/Tech. 6, 1988, pp.47-55; Glover and Hames 
(eds) in DNA Cloning 2, A practical Approach, Expression Systems, Second Edition, 
Oxford University Press, 1 995, 205-244). 

15 

Plant cells or algal cells are others which can be used advantageously for expressing 
genes. Examples of plant expression vectors can be found as mentioned above in 
Becker, D., et al. (1992) "New plant binary vectors with selectable markers located 
proximal to the left border", Plant Mol. Biol. 20: 1195-1197 or in Bevan, M.W. (1984) 
20 "Binary Agrobacterium vectors for plant transformation", Nucl. Acid. Res. 12: 871 1- 
8721. 

Moreover, the nucleic acid sequences according to the invention can be expressed in 
mammalian cells. Examples of suitable expression vectors are pCDM8 and pMT2PC, 

25 which are mentioned in: Seed, B. (1987) Nature 329:840 or Kaufman et al. (1987) 

EMBO J. 6:187-195). Promoters preferably to be used in this context are of viral origin 
such as, for example, promoters of polyoma virus, adenovirus 2, cytomegalovirus or 
simian virus 40. Further prokaryotic and eukaryotic expression systems are mentioned 
in Chapter 16 and 17 in Sambrook et al., Molecular Cloning: A Laboratory Manual. 2nd 

30 ed., Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, NY, 1989. Further advantageous vectors are described in Hellens et al. 
(Trends in plant science, 5, 2000). 

The transgenic organisms which comprise a NCLP sequence are claimed within the 
35 scope of the present invention. 

All of the above-described embodiments of the transgenic organisms, which comprise 
at least one nucleic acid sequence according to the invention come under the term 
"transgenic organism according to the invention". 

40 

The present invention furthermore relates to the use of CLP, in a method for identifying 
herbicidally active test compounds. 
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The method according to the invention for identifying herbicidally active compounds 
preferably comprises the following steps: 

i. bringing CLP into contact with one or more test compounds under conditions 
which permit the test compound(s) to bind to a nucleic acid sequence according 
to the invention or to CLP, and 

ii. detecting whether the test compound binds to the CLP of i), or 

iii. detecting whether the test compound reduces or blocks the enzymatic or biologi- 
cal activity of CLP of i), or 



/r: : iv. detecting whether the test compound reduces or blocks the transcription, transla- 

te. 

15 tion or expression of CLP of i). 

The detection in accordance with step (ii) of the above method can be effected using 
techniques which identify the interaction between the polypeptide and figand. In this 
context, either the test compound or the enzyme can contain a detectable label such 
20 as, for example, a fluorescent label, a radioisotope, a chemiluminescent label or an 
enzyme label. Examples of enzyme labels are horseradish peroxidase, alkaline phos- 
phatase or luciferase. The subsequent detection depends on the label and is known to 
the skilled worker. 

25 In this context, five preferred embodiments which are also suitable for high-throughput 
methods (HTS) in connection with the present invention must be mentioned in particu- 
lar: 

1 . The average diffusion rate of a fluorescent molecule as a function of the mass 
30 can be determined in a small sample volume via fluorescence correlation spec- 

troscopy (FCS) (Proc. Natl. Acad. Sci. USA (1994) 1 1753-1 1575). FCS can be 
employed for determining protein/ligand interactions by measuring the change in 
the mass, or the changed diffusion rate which this entails, of a test compound 
when binding to CLP. A method according to the invention can be designed di- 
35 rectly for measuring the binding of a test compound labeled by a fluorescent 

molecule. As an alternative, the method according to the invention can be de- 
signed in such a way that a chemical reference compound which is labeled by a 
fluorescent molecule is displaced by further test compounds ("displacement as- 
say"). 



40 



Fluoresence polarization exploits the characteristic of a quiescent fluorophore 
excited with polarized light to likewise emit polarized light. If, however, the fluoro- 
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phore is allowed to rotate during the excited state, the polarization of the fluores- 
cent light which is emitted is more or less lost. Under otherwise identical condi- 
tions (for example temperature, viscosity, solvent), the rotation is a function of 
molecule size, whereby findings regarding the size of the fluorophore-bound resi- 
due can be obtained via the reading (Methods in Enzymology 246 (1995), pp. 
283-300). A method according to the invention can be designed directly for 
measuring the binding of a test compound labeled with a fluorescent molecule to 
the CLP. As an alternative, the method according to the invention may also take 
the form of the "displacement assay" described under 1 . 

3. Fluorescence resonance energy'transfer (FRET) is based on the irradiation-free 
energy transfer between two spatially adjacent fluorescent molecules under suit- 
able conditions. A prerequisite is that the emission spectrum of the donor mole- 
cule overlaps with the excitation spectrum of the acceptor molecule. The fluores- 
cent label of CLP, and binding test compound, the binding can be measured by 
means of FRET (Cytometry 34, 1998, pp. 159-179). As an alternative, the 
method according to the invention may also take the form of the "displacement 
assay" described under 1 . An especially suitable embodiment of FRET technol- 
ogy is "Homogeneous Time Resolved Fluorescence" (HTRF) as can be obtained 
from Packard Bioscience. 

4. Surface-enhanced laser desorption/ionization (SELDI) in combination with a time- 
of-flight mass spectrometer (MALDI-TOF) makes possible the rapid analysis of 
molecules on a support and can be used for analyzing protein/ligand interactions 
(Worral et al., (1998) Anal. Biochem. 70:750-756). In a preferred embodiment, 
CLP, is immobilized on a suitable support and incubated with the test compound. 
After one or more suitable wash steps, the test compound molecules which are 
additionally bound to CLP, can be detected by means of the abovementioned 
methodology and test compounds which are bound to CLP, can thus be selected. 

5. The measurement of surface plasmon resonance is based on the change in the 
refractive index at a surface when a test compound binds to a protein which is 
immobilized to said surface. Since the change in the refractive index is identical 
for virtually all proteins and polypeptides for a defined change in the mass con- 
centration at the surface, this method can be applied to any protein in principle 
(Lindberg et al. Sensor Actuators 4 (1983) 299-304; Malmquist Nature 361 
(1993) 186-187). The measurement can be carried out for example with the 
automatic analyzer based on surface plasmon resonance which is available from 
Biacore (Freiburg) at a throughput of, currently, up to 384 samples per day. A 
method according to the invention can be designed directly for measuring the 
binding of a test compound to CLP. As an alternative, the method according to 
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the invention may also take the form of the "displacement assay" described under 
1. 

The compounds identified via the abovementioned methods 1 to 5 may be suitable as 
5 inhibitors. All of the substances identified via the abovementioned methods can subse-* 
quently be checked for their herbicidal action in another embodiment of the method 
according to the invention. 

Furthermore, there exists the possibility of detecting further candidates for herbicidal 
10 active ingredients by molecular modeling via elucidation of the three-dimensional struc- 
ture of CLP, by x-ray structure analysis. The preparation of protein crystals required for 
x-ray structure analysis, and the relevant measurements and subsequent evaluations 
of these measurements, the detection of a binding site in the protein, and the prediction 
of potential inhibitor structures are known to the skilled worker. In principle, an optimi- 
15 zation of the compound identified by the abovementioned methods is also possible via 
molecular modeling. 

A preferred embodiment of the method according to the invention, which is based on 
steps i) and ii), consists in selecting a test compound which reduces or blocks the activ- 
20 ity of the CLP. Preferably, the activity of the CLP, incubated with the test compound is 
herein compared with the activity of a CLP, not incubated with a test compound. 

A more preferred embodiment of the method based on steps i) and ii) consists in 

25 i. expressing CLP in a transgenic organism according to the invention or growing 
an organism which naturally contains a CLP, 

ii. bringing CLP, of step i) in the cell digest of the transgenic or nontransgenic or- 
ganism, in partially purified or in homogeneously purified form, into contact with a 

30 test compound; and 

iii. selecting a compound which reduces or blocks the activity of the nuclear en- 
coded Clp Protease. Preferably the activity of CLP incubated with the test com- 
pound is herein compared with the activity of a CLP, not incubated with a test 

35 compound. 

The solution containing the CLP, can consist of the lysate of the original organism or of 
the transgenic organism which has been transformed with an expression cassette ac- 
cording to the invention. If necessary, the CLP, can be purified partially or fully via cus- 
40 tomary methods. A general overview over current protein purification techniques is de- 
scribed, for example, in Ausubel, F.M. et al., Current Protocols in Molecular Biology, 
Greene Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. In the 
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case of recombinant preparation, the protein which has been fused with an affinity tag 
can be purified via affinity chromatography as is known to the skilled worker. 

The CLP, which is required for in vitro methods can thus be isolated either by means of 
5 heterologous expression from a transgenic organism according to the invention or from 
an organism containing CLP, for example from an undesired plant, the term "undesired 
plant" being understood as meaning the species mentioned at the outset. 

To identify herbicidal compounds, the CLP, is now incubated with a test compound. 

10 After a reaction time, the enzymatic activity of the CLP, incubated with the test com- 
pound is determined in comparison with a CLP, not incubated with a test compound. If 
the CLP, is inhibited, a significant decrease in activity in comparison with the activity of 
the noninhibited polypeptide according to the invention is observed, the result being a 
reduction of at least 10%, advantageously at least 20%, preferably at least 30%, espe- 

15 daily preferably by at least 50%, up to 100% reduction (blocking). Preferred is an inhi- 
bition of at least 50% at test compound concentrations of lO^M, preferably at 10" 5 M, 
especially preferably of lO^M, based on enzyme concentration in the micromolar 
range. 



20 The enzymatic activity of CLP, can be determined for example by an activity assay in 
which the increase of the product, the decrease of the substrate (or starting material) or 
the decrease or increase of the cofactor are determined, or by a combination of at least 
two of the abovementioned parameters, as a function of a defined period of time. 

25 Examples of suitable substrates are, for example small peptides and modified small 

peptides like peptides coupled to a fluorogenic molecule such as aminomethylcoumarin 
and succinylated peptides. 

If appropriate, derivatives of the abovementioned compounds which contain a detect- 
30 able label such as, for example, a fluorescent label (e.g. fluorogenic substrates such as 
N-Suc-Leu-Tyr-(7-amino-4-methylcoumarine) (SLT-AMC), Suc-AIa-Ala-Ala-AMC, Suc- 
Leu-Leu-Val-Tyr-AMC, Suc-Ala-Ala-Phe-AMC, Suc-lle-lle-Trp-AMC, Suc-Ala-Phe-Lys- 
AMC), a radioisotope label or a chemiluminescent label, may also be used. 

35 The amounts of substrates to be employed in the activity tests may range between 0.5 
and 100 mM, based on 1-100 pg/ml enzyme. 
[ 

The activity can be determined for example by tracking Proteolysis fluorimetrically 
when using fluorogenic Peptide substrates analogously to the method described by 
40 Woo et al. 1989 The Journal of Biological Chemistry 264, pp.2088-2091 , which is 
herein incorporated by reference. 
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The activity may also be determined in an ATP-dependent fashion in the presence of 
C|pA, CIpB or CIpC Protein as described in Hafperin et al. 2001, Planta 213, pp. 614- 
619. The preferred Substrate is then b-casein. 

5 Furthermore the activity may be measured by HPLC and HPLC-MS mehtods detecting 
-fragments of the peptides used as substrates. 

Another preferred embodiment of the method according to the invention which is based 
on steps i) and iii) consists of the following steps: 

10 

generating a transgenic organism according to the invention comprising a nucleic 
acid sequence according to the invention, wherein CLP is expressed recombi- 
nant^; 

applying a test compound to the transgenic organism of i) and to a nontransgenic 
organism of the same species; 

determining the growth or the viability of the transgenic and the nontransgenic 
organisms after application of the test substance, and 

selecting test compounds which bring about a reduced growth or a limited viabil- 
ity of the nontransgenic organism in comparison with the growth of the transgenic 
organism. 

25 In this context, the difference in growth in step iv) for the selection of a herbicidally ac- 
tive inhibitor amounts to at least 10%, by preference 20%, preferably 30%, especially 
preferably 40% and very especially preferably 50%. 

The transgenic organism in this context is preferably a plant, an alga, a cyanobacte- 
30 rium, for example of the genus Synechocystis or a proteobacterium such as, for exam- 
ple, Magnetococcus sp. MC1, preferably plants which can be transformed by means of 
customary techniques, such as Arabidopsis thaliana Allium cepa, Ananas comosus, 
Arachis hypogaea, Asparagus officinalis, Beta vulgaris spec, altissima, Beta vulgaris 
spec, rapa, Brassica napus var. napus, Brassica napus var. napobrassica, Brassica 
35 rapa var. silvestris, Camellia sinensis, Carthamus tinctorius, Carya illinoinensis, Citrus 
limon, Citrus sinensis, Coffea arabica (Coffea canephora, Coffea liberica), Cucumis 
sativus, Cynodon dactylon, Daucus carota, Elaeis guineensis, Fragaria vesca, Glycine 
max, Gossypium hirsutum, (Gossypium arboreum, Gossypium herbaceum, Gossypium 
vitifolium), Helianthus annuus, Hevea brasiliensis, Hordeum vulgare, Humulus lupulus, 
40 Ipomoea batatas, Juglans regia, Lens culinaris, Linum usitatissimum, Lycopersicon 

lycopersicum, Malus spec, Manihot esculenta, Medicago sativa, Musa spec, Nicotiana 
tabacum (N.rustica), Olea europaea, Oryza sativa, Phaseolus lunatus, Phaseolus vul- 



^ 15 ii. 

iii. 

20 

iv. 
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garis, Picea abies, Pinus spec, Pisum sativum, Pnjnus avium, Painus persica, Pyrus 
communis, Ribes sylvestre, Ricinus communis, Saccharum officinarum, Secale ce- 
reale, Solanum tuberosum, Sorghum bicolor (s. vulgare), Theobroma cacao, Trifolium 
pratense, Triticum aestivum, Triticum durum, Vicia faba, Vitis vinifera, Zea mays, or 
5 cyanobacteria which can be transformed readily, such as Synechocystis, into which the 
sequence encoding a polypeptide according to the invention has been incorporated by 
transformation. These transgenic organisms thus show increased tolerance to com- 
pounds which inhibit the polypeptide according to the invention. "Knock-out" mutants in 
which the analogous CLProtease gene which is naturally present in this organism has 
10 been selectively switched off may also be used. 

However, the abovementioned embodiment of the method according to the invention 
can also be used for identifying substances with a growth-regulatory action. In this con- 
text, the transgenic organism employed is a plant. The method for identifying sub- 
15 stances with growth-regulatory activity thus comprises the following steps: 

i. generating a transgenic plant comprising a nucleic acid sequence according to 
the invention encoding CLP, wherein CLP is expressed recombinant^; 

20 ii. applying a test substance to the transgenic plant of i) and to a nontransgenic 
plant of the same variety, 



iii. determining the growth or the viability of the transgenic plant and the nontrans- 
genic plant after application of the test compound, and 

25 

iv. selecting test substances which bring about a reduced growth of the nontrans- 
genic plant in comparison with the growth of the transgenic plant. 

Here, step iv) involves the selection of test compounds which bring about a modified 
30 growth of the nontransgenic organism in comparison with the growth of the transgenic 
organism. Modified growth is understood as meaning, in this context, inhibition of the 
vegetative growth of the plants, which can manifest itself in particular in reduced longi- 
tudinal growth. Accordingly, the treated plants show stunted growth; moreover, their 
leaves are darker in color. In addition, modified growth is also understood as meaning 
35 a change in the course of maturation over time, the inhibition or promotion of lateral 

branched growth of the plants, shortened or extended developmental stages, increased 
standing ability, the growth of larger amounts of buds, flowers, leaves, fruits, seed ker- 
nels, roots and tubers, an increased sugar content in plants such as sugarbeet, sugar 
cane and citrus fruit, an increased protein content in plants such as cereals or soybean, 
40 or stimulation of the latex flow in rubber trees. The skilled worker is familiar with the 
detection of such modified growth. 
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.It is also possible, in the method according to the invention, to employ a plurality of test 
compounds in a method according to the invention. If a group of test compounds affect 
the target, then it is either possible directly to isolate the individual test compounds or 
to divide the group of test compounds into a variety of subgroups, for example when it 
5 consists of a multiplicity of different components, in order to thus reduce the number of 
the different test compounds in the method according to the invention. The method 
according to the invention is then repeated with the individual test compound or the 
relevant subgroup of test compounds. Depending on the complexity of the sample, the 
above-described steps can be carried out repeatedly, preferably until the subgroup 
10 identified in accordance with the method according to the invention only comprises a 
small number of test compounds, or indeed just one test compound. 

All of the above-described methods for identifying inhibitors with herbicidal or growth- 
regulatory activity are hereinbelow referred to as "methods according to the invention". 

15 

All of the compounds which have been identified via the methods according to the in- 
vention can subsequently be tested in vivo for their herbicidal and growth-regulatory 
activity. One possibility of testing the compounds for herbicidal action is to use duck- 
weed, Lemna minor, in microtiter plates. Parameters which can be measured are 
20 changes in the chlorophyll content and the photosynthesis rate. It is also possible to 
apply the compound directly to undesired plants, it being possible to identify the herbi- 
cidal action for example via restricted growth. 

The method according to the invention can advantageously also be carried out in high- 
25 throughput methods, known as HTS, which makes possible the simultaneous testing of 
a multiplicity of different compounds. 

The use of supports which contain one or more of the nucleic acid molecules according 
to the invention, one or more of the vectors containing the nucleic acid sequence ac- 
30 cording to the invention, one or more transgenic organisms containing at least one of 
the nucleic acid sequences according to the invention or one or more (poly)peptides 
encoded via the nucleic acid sequences according to the invention lends itself to carry- 
ing out HTS in practice. v 

35 Supports which contain one or more of the NCLP sequences, one or more of the vec- 
tors comprising the NCLP sequences one or more transgenic organisms containing at 
least one NCLP sequences or one or more (poly)peptides encoded by the NCLP se- 
quences are part of the present invention. 

40 The support used can be solid or liquid, but is preferably solid and especially preferably 
a microtiter plate. The abovementioned supports also form part of the subject matter of 
the present invention. In accordance with the most widely used technique, 96-well, 
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384-well and 1536-well microtiter plates which, as a rule, can comprise volumes of 200 
□I, are used. Besides the microtiter plates, the further components of an HTS system 
which match the corresponding microtiter plates, such as a large number of instru- 
ments, materials, automatic pipetting devices, robots, automated plate readers and 
5 plate washers, are commercially available. 

In addition to the HTS systems based on microtiter plates, what are known as "free- 
format assays" or assay systems where no physical barriers exist between the sam- 
ples, as described, for example, in Jayaickreme et al., Proc. Natl. Acad. Sci U.S.A. 19 
10 (1994) 161418; Chelsky, "Strategies for Screening Combinatorial Libraries", First An- 
nual Conference of The Society for Biomolecular Screening in Philadelphia, Pa. (Nov. 
710, 1995)^ Salmon et al., Molecular Diversity 2 (1996), 5763 and US 5,976,813, may 
also be used. 

m ■ . • 

15 The invention furthermore relates to herbicidally active compounds identified by the 
methods according to the invention. These compounds are hereinbelow referred to as 
"selected compounds". They have a molecular weight of less than 1000 g/mol, advan- 
tageously less than 500 g/mol, preferably less than 400 g/mol, especially preferably 
less than 300 g/mol. Herbicidally active compounds have a Ki value of less than 1 mM, 

20 preferably less than 1 pM, especially preferably less than 0.1 (jM, very especially pref- 
erably less than 0.01 pM. 

Examples for herbicidally active compounds identified with the above mentioned HTS 
methods are the compounds of the formula: 

25 




ffc Y + ~ formula (I) 

30 or 
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The invention furthermore relates to compounds with growth-regulatory activity identi- 
fied by the methods according to the invention. These compounds too are hereinbelow 
referred to as "selected compounds". . . 

Naturally, the selected compounds can also be present in the form of their agriculturally 
useful salts. Agriculturally useful salts which are suitable are mainly the salts of those 
cations, or the acid addition salts of those acids, whose cations, or anions, do not ad- 
versely affect the herbicidal action of the herbicidally active compounds identified via 
the methods according to the invention. 

If the selected compounds contain asymmetrically substituted O-carbon atoms, they 
may furthermore also be present in the form of racemates, enantiomer mixtures, pure 
enantiomers or, if they have chiral substituents, also in the form of diastereomer mix- 
tures. 

The selected compounds can be chemically synthesized substances or substances 
produced by microbes and can be found, for example, in cell extracts of, for example, 
plants, animals or microorganisms. The reaction mixture can be a cell-free extract or 
comprise a cell or cell culture. Suitable methods are known to the skilled worker and 
are described generally for example in Alberts, Molecular Biology the cell, 3rd Edition 
(1994), for example chapter 17. The selected compounds may also originate from 
comprehensive substance libraries. 

Candidate test compounds can be expression libraries such as, for example, cDNA 
expression libraries, peptides, proteins, nucleic acids, antibodies, small organic sub- 
stances, hormones, PNAs or the like (Milner, Nature Medicin 1 (1995), 879-880; Hupp, 
Cell. 83 (1995), 237-245; Gibbs, Cell. 79 (1994), 193-198 and references cited 
therein). 

The selected compounds can be used for controlling undesired vegetation and/or as 
growth regulators. Herbicidal compositions comprising the selected compounds afford 
very good control of vegetation on noncrop areas. In crops such as wheat, rice, maize, 
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soybean and cotton, they act against broad-leaved weeds and grass weeds without 
inflicting any significant damage on the crop plants. This effect is observed in particular 
at low application rates. The selected compounds can be used for controlling the harm- 
ful plants which have already been mentioned above. 

5 

Depending on the application method in question, selected compounds, or herbicidal 
compositions comprising them, can advantageously also be employed in a further 
number of crop plants for eliminating undesired plants. Examples of suitable crops are: 

10 Allium cepa, Ananas comosus, Arachis hypogaea, Asparagus officinalis, Beta vulgaris 
spec, altissima, Beta vulgaris spec, rapa, Brassica napus var. napus, Brassica napus 
var. napobrassica, Brassica rapa var. silvestris, Camellia sinensis, Carthamus tincto- 
rius, Carya illinoinensis, Citrus limon, Citrus sinensis, Coffea arabica (Coffea can- 
ephora, Coffea liberica), Cucumis sativus, Cynodon dactylon, Daucus carota, Elaeis 

15 guineensis, Fragaria vesca, Glycine max, Gossypium hirsutum, (Gossypium arboreum, 
Gossypium herbaceum, Gossypium vitifolium),,Helianthus annuus, Hevea brasiliensis, 
Hordeum vulgare, Humulus lupulus, Ipomoea batatas, Juglans regia, Lens culinaris, 
Linum usitatissimum, Lycopersicon lycopersicum, Malus spec, Manihot esculenta, 
Medicago sativa, Musa spec, Nicotiana tabacum (N.rustica), Olea europaea, Oryza 

20 sativa, Phaseolus lunatus, Phaseolus vulgaris, Picea abies, Pinus spec, Pisum sati- 
vum, Prunus avium, Prunus persica, Pyrus communis, Ribes sylvestre, Ricinus com- 
munis, Saccharum officinarum, Secale cereale, Solanum tuberosum, Sorghum bicolor 
(s. vulgare), Theobroma cacao, Trifolium pratense, Triticum aestivum, Triticum durum, 
Vicia faba, Vitis vinifera, Zea mays. 

25 

In addition, the selected compounds can also be used in crops which tolerate the ac- 
tion of herbicides owing to breeding, including recombinant methods. The generation of 
such crops is described hereinbelow. 

30 The invention furthermore relates to a method of preparing the herbicidal or growth- 
regulatory composition which has already been mentioned above, which comprises 
formulating selected compounds with suitable auxiliaries to give crop protection prod- 
ucts. 

35 The selected compounds can be formulated for example in the form of directly spray- 
able aqueous solutions, powders, suspensions, also highly concentrated aqueous, oily 
or other suspensions or suspoemulsions or dispersions, emulsifiable concentrates, 
emulsions, oil dispersions, pastes, dusts, materials for spreading or granules, and ap- 
plied by means of spraying, atomizing, dusting, spreading or pouring. The use forms 

40 depend on the intended use and the nature of the selected compounds; in any case, 
they should guarantee the finest possible distribution of the selected compounds. The 
herbicidal compositions comprise a herbicidally active amount of at least one selected 
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compound and auxiliaries conventionally used in the formulation of herbicidal composi- 
tions. 

For the preparation of emulsions, pastes or aqueous or oily formulations and dispersi- 
5 ble concentrates (DC), the selected compounds can be dissolved or dispersed in an oil 
or solvent, it being possible to add further formulation auxiliaries for homogenization. 
However, it is also possible to prepare liquid or solid concentrates from selected com- 
pound, if appropriate solvents or oil and, optionally, further auxiliaries comprising liquid 
or solid concentrates, and these concentrates are suitable for dilution with water. The 

10 following can be mentioned: emulsifiable concentrates (EC, EW), suspensions (SC), 
soluble concentrates (SL), dispersible concentrates (DC), pastes, pills, wettable pow- 
ders or granules, it being possible for the solid formulations either to be soluble or dis- 
persible (wettable) in water. In addition, suitable powders or granules or tablets can 
additionally be provided with a solid coating which prevents abrasion or premature re- 

1 5 lease of the active ingredient. 

In principle, the term "auxiliaries" is understood as meaning the following classes of 
compounds: antifoams, thickeners, wetting agents, tackifiers, dispersants, emulsifiers, 
bactericides and/or thixotropic agents. The skilled worker is familiar with the meaning of 
20 the abovementioned agents. 

SLs, EWs and ECs can be prepared by simply mixing the ingredients in question; pow- 
ders can be prepared by mixing or grinding in specific types of mills (for example ham- 
mer mills). DCs, SCs and SEs are usually prepared by wet milling, it being possible to 

25 prepare an SE from an SC by addition of an organic phase which may comprise further 
auxiliaries or selected compounds. The preparation is known. Powders, materials for 
spreading and dusts can advantageously be prepared by mixing or cogrinding the ac- 
tive substances together with a solid carrier. Granules, for example coated granules, 
impregnated granules and homogeneous granules, can be prepared by binding the 

30 selected compounds to solid carriers. The skilled worker is familiar with further details 
regarding their preparation, which are mentioned for example in the following publica- 
tions: US 3,060,084, EP-A 707445 (for liquid concentrates), Browning, "Agglomera- 
tion", Chemical Engineering, Dec. 4, 1967, 147-48, Perry's Chemical Engineer's Hand- 
book, 4th Ed., McGraw-Hill, New York, 1963, pages 8-57 and et seq. WO 91/13546, 

35 US 4,172,714, US 4,144,050, US 3,920,442, US 5,180,587, US 5,232,701, US 

5,208,030, GB 2,095,558, US 3,299,566, Klingman, Weed Control as a Science, John 
Wiley and Sons, Inc., New York, 1961, Hance et al., Weed Control Handbook, 8th Ed., 
Blackwell Scientific Publications, Oxford, 1989 and Mollet, H., Grubemann, A., Formu- 
lation technology, Wiley VCH Verlag GmbH, Weinheim (Federal Republic of Germany), 

40 2001. 
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The skilled worker is familiar with a multiplicity of inert liquid and/or solid carriers which 
are suitable for the formulations according to the invention, such as, for example, liquid 
additives such as mineral oil fractions of medium to high boiling point such as kerosene 
or diesel oil, furthermore coal tar oils and oils of vegetable or animal origin, aliphatic, 
5 cyclic and aromatic hydrocarbons, for example paraffin, tetrahydrophthalene, alkylated * 
naphthalenes or their derivatives, alkylated benzenes or their derivatives, alcohols such 
as methanol, ethanol, propanol, butanol and cyclohexanol, ketones such as cyclohexa- 
none, or strongly polar solvents, for example amines such as N-methylpyrrolidone or 
water. 

10 

Examples of solid carriers are mineral 'earths such as silicas, silica gels, silicates, talc, 
kaolin, limestone, lime, chalk, bole, loess, clay, dolomite, diatomaceous earth, calcium 
sulfate, magnesium sulfate, magnesium oxide, ground synthetic materials, fertilizers 
such as ammonium sulfate, arhmonium phosphate, ammonium nitrate, ureas and 
15 products of vegetable origin such as cereal meal, tree bark meal, wood meal and nut- 
shell meal, cellulose powders or other solid carriers. 

The skilled worker is familiar with the multiplicity of surface-active substances (surfac- 
tants) which are suitable for the formulations according to the invention such as, for 

20 example, alkali metal salts, alkaline earth metal salts or ammonium salts of aromatic 
sulfonic acids for example lignosulfonic acid, phenolsulfonic acid, naphthalenesulfonic 
acid, and dibutylnaphthalenesulfonic acid, and of fatty acids, of alkyl- and alkylarylsut- 
fonates, of alkyl sulfates, lauryl ether sulfates and fatty alcohol sulfates, and salts of 
sulfated hexa-, hepta- and octadecanols and of fatty alcohol glycol ethers, condensates 

25 of sulfonated naphthalene and its derivatives with formaldehyde, condensates of naph- 
thalene or of the naphthalenesulfonic acids with phenol and formaldehyde, poly- 
oxyethylene octylphenol ether, ethoxylated isooctyl-, octyl- or nonylphenot, alkylphenyl 
polyglycol ethers, tributylphenyl polyglycol ether, alkylaryl polyether alcohols, isotridecyl 
alcohol, fatty alcohol/ethylene oxide condensates, ethoxylated caster oil, polyoxyethyl- 

30 ene alkyl ethers or polyoxypropylene alkyl ethers, lauryl alcohol polyglycol ether ace- 
tate, sorbitol esters, lignosulfite waste liquors or methylcellulose. 

The herbicidal compositions, or the selected compounds, can be applied pre- or post- 
emergence. If the selected compounds are less well tolerated by certain crop plants, 
35 application techniques may be used in which the selected compounds are sprayed, 
with the aid of the spraying apparatus, in such a way that they come into as little con- 
tact, if any, with the leaves of the sensitive crop plants while the selected compounds 
reach the leaves of undesired plants which grow underneath, or the bare soil surface 
(post-directed, lay-by). 

40 



PF 55117 



45 

Depending on the intended purpose of the control measures, the season, the target 
plants and the growth stage, the application rates of selected compounds amount to 
0.001 to 3.0, preferably 0.01 to 1.0 kg/ha. 

5 The invention is illustrated in greater detail by the examples which follow, which are not 
to be considered as limiting. 

General DNA manipulation and cloning methods 

10 Cloning methods such as, for example, restriction cleavages, agarose gel electropho- 
reses, purification of DNA fragments, transfer of nucleic acids to nitrocellulose and ny- 
lon membranes, linking DNA fragments, transformation of Escherichia coli cells, grow- 
ing bacterium and sequence analyses of recombinant DNA were carried out as de- 
scribed by Sambrook et al. (1989) (Cold Spring Harbor Laboratory Press: ISBN 0- 

15 87969-309-6) and Ausubel, F.M. et al., Current Protocols in Molecular Biology, Greene 
Publishing Assoc. and Wiley-lnterscience (1994); ISBN 0-87969-309-6. 

Molecular-biological standard methods for plants and plant transformation methods are 
described in Schultz et al., Plant Molecular Biology Manual, Kluwer Academic Publish- 
20 ers (1998), Reither et al., Methods in Arabidopsis Research, World scientific press 
(1992) and Arabidopsis: A Laboratory Manual (2001), ISBN 0-87969-573-0. 

The bacterial strains used hereinbelow (E. coli DH5, XL-1 blue) were obtained from 
Stratagene, BRL Gibco or Invitrogen, Carlsberg, CA. The vectors used for cloning were 
25 pUC 19 from Amersham Pharmacia (Freiburg) and the vector pBinAR (Hofgen and 
Willmitzer, Plant Science 66, 1990, 221-230). 

t 

Example 1: Generation of a cDNA library in the plant transformation vector 

30 To generate a cDNA library (hereinbelow termed "binary cDNA library") in a vector 
which can be used directly for transforming plants, mRNA was isolated from a variety 
of plant tissues and transcribed into double-stranded cDNA using the cDNA Synthese 
Kit (Amersham Pharmacia Biotech, Freiburg). The cDNA first-strand synthesis was 
carried out using T12-18 oligonucleotides following the manufacturer's instructions. 

35 After size fractionation and the ligation of EcoRI-Notl adapters following the manufac- 
turer's instructions and filling up the overhangs with Pfu DNA polymerase (Stratagene), 
the cDNA population was normalized. The method of Kohci et al, 1995, Plant Journal 8, 
771-776 was followed, the cDNA being amplified by PCR with the oligonucleotide N1 
under the conditions given in Table 1 . 

40 
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Table 1 
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94 




1 


94 
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CO 
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72 
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y4 


p 
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50 


60 




72 


180 




94 


8 


10 


48 


60 




72 


180 




72 


420 


1 



5 The resulting PCR product was bound to the column matrix of the PCR purification kit 
(Qiagen, Hilden) and eluted with 300 mM NaP buffer, pH 7.0, 0.5 mM EDTA, 0.04% 
SDS. The DNA was denatured for 5 minutes in a boiling water bath and subsequently 
renatured for 24 hours at 60oC. 50\i\ of the DNA were applied to a hydroxylapatite col- 
umn and the column was washed 3 times with 1 ml of 10 mM NaP buffer, pH 6.8. The 
10 bound single-stranded DNA was eluted with 130 mM NaP buffer, pH 6.8, precipitated 
with ethanol and dissolved in 40 pi of water. 20 \i\ of growth were used for a further 
PCR amplification as described above. After further ssDNA concentration, a third PCR 
amplification was carried out as described above. 

15 The plant transformation vector for taking up the cDNA population which had been 
generated as described above was generated via restriction enzyme cleavage of the 
vector pUC18 with Sbfl and BamHI, purification of the vector fragment followed by fill- 
ing up the overhangs with Pfu DNA polymerase and relegation with T4 DNA ligase 
(Stratagene). The resulting construct is hereinbelow termed pUC18Sbfk 

20 

The vector pBinAR was first cleaved with Notl, the ends were filled up and the vector 
was relegated, cleaved with Sbfl, the ends were filled up and the vector was relegated 
and subsequently cleaved with EcoRI and Hindlll. The resulting fragment was ligated 
into a derivative of the binary plant transformation vector pPZP (Hajdukiewicz.P, Svab, 
25 Z, Maliga, P., (1994) Plant Mol Biol 25:989-994) which makes possible the transforma- 
tion of plants by means of agrobacterium and mediates kanamycin resistance in trans- 
genic plants. The construct generated thus is hereinbelow termed pSun12/35S. 
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pUC18Sbfl- was used as template in a polymerase chain reaction (PCR) with the oli- 
gonucleotides V1 and V2 (see Table 2) and Pfu DNA polymerase. The resulting frag- 
ment was ligated into the Smal-cut pSun12/35S, giving rise to pSunblues2. Following 
cleavage with Notl, dephosphorylation with shrimp alkaline phosphatase (Roche Diag- 
5 nostics, Mannheim) and purification of the vector fragment, pSunblues2 was ligated 
with the normalized, likewise Notl-cut cDNA population. Following transformation into 
E.coli XMblue (Stratagene), the resulting clones were deposited into microtiter plates. 
The binary cDNA library contains cDNAs in "sense"- and in "antisense" orientation un- 
der the control of the cauliflower mosaic virus 35S promoter, and, after transformation 
10 into tobacco plants, these cDNAs can, accordingly, lead to "cosuppression" and "an- 
tisense" effects. 

Table 2: Oligonucleotides used 



Oligonucleotide 


Nucleic acid sequence 


N1 


5'-AGAATTCGCGGCCGCT-3: (SEQ ID NO:23) 


V1 (PWL93not) 


5'-CTCATGCGGCCGCGCGCAACGCAATTAATGTG-3' (SEQ 
ID NO:24) 


V2 (pWL92) 


5'-TCATGCGGCCGCGAGATCCAGTTCGATGTAAC-3" (SEQ 
ID NO:25) 


G1 (35S) 


5-GTGGATTGATGTGATATCTCC-3' (SEQ ID NO:26) 


G2 (OCS) 


5'-GTAAGGATCTGAGCTACACAT-3' (SEQ ID NO:27) 



Example 2: Transformation and analysis of tobacco plants 

Selected clones of the binary cDNA library were transformed into Agrobacterium tume- 
20 faciens C58C1:pGV2260 and (Deblaere et al., Nucl. Acids. Res. 13(1984), 4777-4788) 
and incubated with Streptomycin/Spectinomycin selection. The material used for the 
transformation of tobacco plants (Nicotiana tabacum cv. Samsun NN) with one of the 
binary clones as depicted in table 3 was an overnight culture of a positively trans- 
formed agrobacterial colony diluted with YEB medium to OD600 = 0.8-1 .6. Leaf discs 
25 of sterile plants (approx. 1 cm2 each) were incubated for 5-10 minutes with a 1:50 
agrobacterial dilution in a Petri dish. This was followed by incubation in the dark for 2 
days at 25°C on Murashige-Skoog medium (Physiol. Plant. 15(1962), 473) supple- 
mented with 2% sucrose (2MS medium) and 0.8% Bacto agar. The cultivation was con- 
tinued after 2 days at a 16-hour-light/8-hour-darkness photoperiod and continued in a 
30 weekly rhythm on MS medium supplemented with 500mg/l Claforan (cefotaxime so- 
dium), 50mg/l kanamycin, 1mg/l benzylaminopurin (BAP), 0.2mg/l naphthylacetic acid 
and 1.6g/l glucose. Regenerated shoots were transferred onto an MS medium supple- 
mented with kanamycin and Claforan. Transgenic plants of lines as depicted in table 3 
were generated in this manner. 
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Table 3: Plant lines generated 



Partial cdna with pheno- 
type in transgenic tobacco 


Plant line 


Corresponding 
full length cDNA 


runction 


SEQ ID NO: 20 


E_0000013511 


SEQ ID NO: 3 


ClpP2 protease 


SEQIDNO:19 


E_0000008893 


SEQ ID NO: 11 


ClpP5=ClpR1 pro- 
tease 


SEQ ID NO:21 


E_0000012393 




ClpP6 protease 






SEQ ID NO: 17 


ClpP-like protease 



The integration of the clone cDNA into" the genome of the transgenic lines was detected 
5 via PGR with the oligonucleotides G1 and G2 (see Table 2) and genomic DNA pre- 
pared from the transgenic lines in question. To this end, TAKARA Taq DNA poly- 
merase was preferably employed for this purpose, following the manufacturer's instruc- 
tions (MoBiTec, Gottingen). The cDNA clone of the binary cDNA library, which clone 
had been used for the transformation, acted as template for a PGR reaction as the 
10 positive control. PCR products with an identical size or, if appropriate, identical cleav- 
age patterns which were obtained after cleavage with a variety of restriction enzymes 
acted as proof that the corresponding cDNA had been integrated. In this manner, the 
insert of clones were detected in the respective transgenic plant lines (as depicted in 
table 3) with the belowmentioned phenotypes. 

15 

After the shoots had been transferred into soil, the plants were observed for 2-20 
weeks in the greenhouse for the manifestation of phenotypes. It emerged that trans- 
genic plants of lines EJ3000012393, E_000001351 1 and E_0000008893 were similar 
in phenotype. The plants showed severe chlorosis and concomitant growth retardation 
20 with respect to wild type plants after 2 weeks. 

Example 3: Sequence analysis of the clones 

SEQ ID NO:19 was fully sequenced and used for the detection of the corresponding full 
25 length clone SEQ ID NO:1 1. SEQ ID NO:1 1 is identical to nt002050071r S EQ ID 

NO: 19 in the overlapping region. An open reading frame of 867 nt (pos. 2-1 162) en- 
codes for 387 amino acids (SEQ ID NO: 4) with highest identity to-ClpR1 from Arabi- 
dopsis thaliana-. Sequence homology suggests that the 5-ends of SEQ ID NO:1 1 and 
ClpR1 from Arabidopsis thaliana are very diverse and that nt006066004r is close to 
30 being full size with respect to ClpR1. MS-Analysis of isolated CIpP Proteins from Arabi- 
dopsis indicate, that the mature ClpR1 is several kDal shorter as suggested by the 
cDNA Sequence (Peltier et al. 2001, The Journal of Biological Chemistry 276, 
99.16318-16327). 
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SEQ ID NO:20 was fully sequenced and used for the detection of the corresponding full 
length clone, SEQ ID NO: 3r SEQ ID NO:3 is identical to SEQ ID NO.20 in the overlap- 
ping region. An open reading frame of 867 nt (pos. 11 -877) encodes for 289 amino 
acids (SEQ ID NO:4) with highest identity to ClpP2 from Arabidopsis thaliana. 

5 

SEQ ID NO:21 was fully sequenced . The partial cDNA Sequence of 602 nt contains an 
open reading frame of 186 nt (nt 8-193) encoding for 62 amico acids (SEQ ID NO:22). 
This partial polypeptide shows highest identity to ClpP6 from Arabidopsis thaliana 
(SEQ ID NO:9) 

10 

A further ClpP-homolog cDNA of 906 nt (SEQ ID NO: 17) was identified. An open read- 
ing frame of 71 1 nt (pos. 45-755) encodes for 237 amino acids (SEQ ID NO: 18) with 
highest identity to a ClpP-like protein from Arabidopsis thaliana (GeneBank Acc. No. 
AK1 18523). 

15 

Thus, it was shown for the first time and in a surprising manner that the natural expres- 
sion of nuclear encoded Clp protease encoding genes is essential for plants and that 
reduced expression leads to damage as depicted by the phenotypes mentioned in Ex- 
ample 2 demonstrating the suitability of nuclear encoded Clp-proteases as target for 
20 herbicides. 

Example 4: Expression in E.coli 

In order to generate active protein with nuclear encoded Clp-protease activity frag- 
ments of SEQ ID NO:1 1 , -SEQ ID NO:3 and of SEQ ID NO:9 were subcloned into the 
expression vector pQE60 (Quiagen, Hilden, Germany). To this end the oligonucleotides 
displayed in tab. 4 where used to amplify via polymerase chain reaction cDNA frag- 
ments that contain Ncol and Bglll restriction sites. The PCR was carried out in 36 cy- 
cles following standard conditions (for example as described by Sambrook, J. et al. 
(1989) "Molecular cloning: A laboratory manual", Cold Spring Harbor Laboratory 
Press), the annealing temperatures being between 45 and 55°C and the polymerization 
time being in each case 60 seconds per 1000bp. Cutting the cDNA fragments with Ncol 
and Bglll restriction enzymes and ligation into pQE60 cut with the same enzymes de- 
livered expression plasmids that were transformed into E. coli. Expression was per- 
formed in E. coli TOP 10F strains (Invitrogen, Karlsruhe, Germany) following induction 
with IPTG. Standard protocols (Invitrogen) were followed. 

Enzyme preperations were achieved by breaking cells in a French-Press in 100mM 
Tris/HCI, pH 7.4, 2.5 mM EDTA, 1% Triton X-100. 

40 

The expression products were purified by affinity chromatography on Ni-agarose where 
appropriate. The manufacturer's instructions were followed (Qiagen). 
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Table 4 





Primer /Mi if*loif* £Jr"*iH com tonrp^ 


NT_ClpR1 , > 


5'-TATACCATGGATTTGCCATCTTTG-3' (SEQ ID NO:28) 
5'-ATAGATCTCACCTGGAGCCAG-3' (SEQ ID NO:29) 


Nt_ClpP2 1) 


5-GAGCCCATGGCAAGAGGAG -3' (SEQ ID NO:30) 
5'-ATAGATCTTTCTAGCTTGAACC-3' (SEQ ID NO:31) 


AT_ClpP6 2) 


5-TCAGCCATGGCCCCTGGAGGAC -3'(SEQ ID NO:32) 
5'-TAAGATCTTCAGTATTCTGTTTCC-3 ' (SEQ ID NO:33) 



1) Template: Nicotiana tabacum cDNA library 
5 2) Template: Arabidopsis thaliana cDNA library 

Example 5: Activity assay 

Isolated CIpP activity can be measured as described (Woo et al. 1989 The Journal of 
10 Biological Chemistry 264, pp.2088-2091) by using fluoregenic substrates such as N- 
Suc-Leu-Tyr-(7-amino-4-methylcoumarine) (SLT-AMC). The proteolytic cleavage delib- 
erates 7-amino-4-methylcoumarin, which can be detected fluorimetrically (emission at 
460nm by exitation at 390 nm). 

Standard assays contain: 50mM Tris/HCI, pH 8.0, 25mM MgCI 2 , 1mM SLT-AMC and 1- 
15 100 pg CIpP Enzyme. 

The assay is suitable in for high throughput screening in 96well and 384 well format. 

Screening according to the above mentioned assay provided the following compounds 
20 of the formula: 




formula (I) 



formula (II) 
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CH 3 




formula (III). 



showing a inhibition of the enzyme of: 1 



compound of formula 


IC50 


(") 


2.3E-05 


(ID 


1.9E-05 

15 — 


(HI) 


2.5E-05 



20 



25 



Sequence Listing 



Sequence 



Function Organism 



SEQ ID NO:1 (nucleic acid sequence) ClpP1 Arabidopsis thanliana 

SEQ ID NO:2 (amino acid sequence) ClpP1 Arabidopsis thanliana 

30 SEQ ID NO:3 (nucleic acid sequence) ClpP2 Nicotiana tabacuum 

SEQ ID NO:4 (amino acid sequence) ClpP2 Nicotiana tabacuum 



35 



SEQ ID NO:5 (nucleic acid sequence) ClpP3 Arabidopsis thanliana 

SEQ ID NO:6 (amino acid sequence) ClpP3 Arabidopsis thanliana 

SEQ ID NO:7 (nucleic acid sequence) ClpP4 Arabidopsis thanliana 

40 SEQ ID NO:8 (amino acid sequence) ClpP4 Arabidopsis thanliana 

SEQ ID NO:9 (nucleic acid sequence) ClpP6 Arabidopsis thanliana 
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SEQ ID NO:10 (amino acid sequence) ClpP6 Arabidopsis thanliana 

SEQ ID NO:1 1 (nucleic acid sequence) ClpR1 Nicotiana tabacuum 

5 

SEQ ID NO: 12 (amino acid sequence) ClpR1 Nicotiana tabacuum 

SEQ ID NO: 13 (nucleic acid sequence) ClpR3 Arabidopsis thanliana 

10 SEQ ID NO: 14 (amino acid sequence) ClpR3 Arabidopsis thanliana 

SEQ ID NO: 15 (nucleic acid sequence) ClpR4 Arabidopsis thanliana 



15 



SEQ ID NO: 16 (amino acid sequence) ClpR4 Arabidopsis thanliana 
SEQ ID NO: 17 (nucleic acid sequence) CIpP like Arabidopsis thanliana 
SEQ ID NO: 18 (amino acid sequence) CIpP like Arabidopsis thanliana 



20 SEQ ID NO: 19 (nucleic acid sequence) ClpR1 Nicotiana tabacuum 
(fragment) 

SEQ ID NO:20 (nucleic acid sequence) ClpP2 Nicotiana tabacuum 
(fragment) 

25 

SEQ ID NO:21 (nucleic acid sequence) ClpP6 Nicotiana tabacuum 
(fragment) 

SEQ ID NO:22 (amino acid sequence) ClpP6 Nicotiana tabacuum 
30 (fragment) 

SEQ ID NO:23-33: Primer (nucleic acid sequences) 



